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SIR GODFREY THOMSON 


„ Readers will be grieved to learn that Sir Godfrey Thomson, for many years one of the 
editors of this Journal, died in hospital on February 9th. He had long been regarded as 
the most eminent living representative of Scottish education °. Though not himself a Scot, 
he was born at Carlisle, not far from the Scottish border, on 27 March 1881. For the earlier 
Part of his life his home was on Tyneside. But in 1925 he crossed the border, and the re- 
mainder of his vears were spent in Edinburgh. 

In his autobiography he relates how he * went to the local Board School, and won a 
Scholarship which provided books and free tuition at Rutherford College in Newcastle-on- 
Tyne’. At the age of sixteen he returned to his old school as a pupil teacher * with a salary 
of £15 for the first year ’.1 Two years later he won a * Queen's scholarship ° which gave 
Pupil teachers free tuition for a degree; and chose the Durham College of Science. In 1903 
he obtained the B.Sc. with distinction in Physics and Mathematics. He then took a post as 
Physics master in a Roman Catholic School, but shortly afterwards obtained another scholar- 
Ship which enabled him to study in Germany. At that time he believed that his own future 
lay in the field of wireless telegraphy. He therefore decided to go to Strasburg, where the 

oremost German expert in the subject, Ferdinand Braun, held the Chair of Physics. In 1906 
€ received the doctorate summa cum laude for a thesis Über den Durchgang Hertzscher 
Wellen durch Gitter. 

The conditions attached to the Queen's scholarship required holders, after due quali- 
fication, to teach for nine years * in an Elementary School, the Army, the Navy, or the 
Workhouse °, Accordingly Thomson returned to Armstrong College at Newcastle to 
fake up a post as lecturer to students intending to teach in elementary schools, which was 
accepted as equivalent to teaching there oneself. Among other courses he now had to 
deliver lectures on educational psychology. For this he determined to equip himself by 

„Spending the summer vacation under C. S. Myers in the Psychological Laboratory at 
Cambridge, where I myself first had the pleasure of meeting him. He relates how Myers 
One day brought me William Brown’s little book, The Essentials of Mental Measurement? 
(to which Thomson himself later contributed several chapters), and how it aroused his interest 
In psychological methods. As a result he and his wife spent several months collecting 
ata on weight-discrimination ; and in 1912 he submitted the results to the Editor of the 
British Journal of Psychology. The manuscript, he tells us, was first returned with a criticism 
Tom the referee, saying “the writer does not seem to know of F. M. Urban's work’. 
Owever, with warm encouragement from Myers, who had already decided that “ the 
Young man from the North was one of his ablest students ", he set himself to study, and 
Eventually to correct Urban's formula, “ thus placing the ‘constant process’ on a basis 
Which made it the precursor of the modern * probit method ° ". 

He next turned his attention to the second part of Brown's book. From 1904 onwards, 
Under the influence of McDougall at Oxford, Brown, Spearman, and I had been trying to 
&PPly Pearson’s new method of correlation to measurements obtained from mental tests ; 

ut as Brown reports, our conclusions were widely different: whereas ] found both 
general and group factors, Spearman found only a general factor, and Brown found nothing 
but Sroup factors. Spearman replied to Brown's criticisms in a long paper, written jointly 
ИМ Hart, in which, as Thomson puts it, he made “ sweeping claims for the theory that ail 
Correlations bétween psychological tests were due to only one factor, =”, Thomson had the 
1арру idea of representing the rival hypotheses by patterns of dice, and was thus able to show 

wait was possible to make a set of artificial test scores, without any general factor, which 
Tq theless gave correlations fully satisfying Spearman's criterion for the Two-Factor theory". 
D € account was published in 1916, and formed the first of a long series of highly original 

apers on factorial techniques. In his own opinion his most important discovery was his 


1A History of Psychology in Autobiography, Vol. IV (H. S. Langfield, ed., 1952). 
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proof that “the random interplay of a large number of small independent influences will 
produce a matrix of correlations which can be reduced to a low rank, or even to a rank of 
unity ” ; and on this he based his celebrated * sampling theory `. 

After the first world war was over, Pearson offered him a post in the Galton Laboratory 
at University College ; but having been already assured of a Chair of Education at Newcastle, 
Thomson eventually decided—to the great disappointment of those of us who were working 
in London—to decline the offer. During the next six years he was mainly occupied in pre- 
paring his books on Instinct, Intelligence and Character and A Modern Philosophy of Educa- 
tion—both based on his courses as Professor of Education. 

Meanwhile, Dr. Andrew Messer, the Chairman of the Education Committee for Nor- 
thumberland (the county whose capital is Newcastle), persuaded the local authority to intro- 
duce a scheme of mental testing similar to that already established in London. Thomson 
was invited to assist in the project, and thus began the series of * Northumberland Tests ， 
in which I was occasionally asked to co-operate. In this way I myself became a frequent 
visitor to his house ; and like all who came into touch with him, profited greatly from his keen 
and stimulating discussions of educational and statistical problems. There are few to whom 
I owe so much. А 

In 1925 he was invited to accept the Directorship of the Teacher Training Colles 2 
Moray House—a post which carried with it the Chair of Education at Edinburgh. Aided by 
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During the first half of the century, Britain was fortunate in having а aumber 
many of international repute, who were interested in the application of statistical 
psychological problems—some were statisticians, like Pearson, Yule, W. F. Sheppare, © 
M. S. Bartlett, others psychologists or educationists like Spearman, Nunn,'Brown, Махи ге 
Garnett, and Thomson himself—most of them, alas, no longer with us.” Nearly all MS 
members of the British Psychological Society, and were able, just before the second E 
persuade the Council to consider publishing a new periodical dealing specifically with ADR 
matical techniques. The war itself deferred its appearance. But in 1947 publication Td 
started with Thomson as one of the editors. Until his health compelled him to erm us 
the work, he proved an invaluable collaborator, severe in the standards which he set peior self 
tireless in helping junior authors, active in seeking publications from overseas. He uiy 
contributed several papers to the earlier volumes, each a model of terse and lucid presentat е 
On retiring from his Chair, һе planned once again to take up the statistical problems ша иге- 
So often engaged his attention. His brilliant little book on The Geometry of Mental Ме зле 
ment, published only last year, is an outline of what he had in view. We hope in a later! ho- 
to print a fuller review of his many lasting contributions to the field of education and po OD 
logy. Here it is only possible to express the grief which teachers, psychologists, educa t his 
ists, and a large company of personal friends both in this country and abroad will feel a 
irreparable loss. Сувш. BURT 
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A STUDY OF THE PERFORMANCE 
OF 2,000 CHILDREN 
ON FOUR VOCABULARY TESTS 


І. GROWTH CURVES AND SEX DIFFERENCES 


By M. I. DUNSDON and J. A. FRASER ROBERTS 
Burden Mental Research Department, Stoke Park Hospital, Bristol 

Il. Results: (a) Basic Data; (b) Linearity 
of the Growth Curves: Increment of Words with CA; (c) Equations Relating CA and 


Words Defined; (d) Sex Differences; (e) Intercorrelations of the Four Vocabularies; 
(f) Comparison with the Previous Sample. 11. The Inclusion of Independent and 
Private Schools. IV.  Absentees. V. Summary. 


I. Introduction: Material and Methods. 


I. INTRODUCTION: MATERIAL AND METHODS 


і and Roberts, 1953) reasons were given why it was 
desired to use vocabulary scales as quickly-given individual tests, not for the assess- 
ment of single children, but for the comparison of groups. For the comparison of 
large groups vocabulary tests have manifest advantages in respect of reliability, 
validity, relative absence of practice effect, and, above all, in the amount of informa- 
tion obtained рег unit of testing time. In order to study in some detail the per- 
ormance of children on vocabulary tests, and to establish norms for comparison, it 
Was decided to apply four scales to a relatively large sample, which it was also hoped 
Would be effectively random. By using four vocabularies the consistency of the 
results can be assessed, alternatives are made available, or; on the other hand, two 
Or more vocabularies can be combined. 


M The four oral definition vocabularies Use 
errill (1937 Revision) Form L, Raven's Mill Hill Vocabulary Tests A and B, and 


that from Wechsler's Intelligence Scale for Children. We are greatly indebted to 
Professor L. M. Terman, Mr. J. C. Raven and Dr. D. Wechsler for permission to 
Use their vocabularies for this piece of work. In the subsequent text, the four 
Vocabularies will be referred to as TML, MHA, MHB and WISC. 

Mention should be made of the way in which the vocabularies were administered 
and scored. In the case of TML and WISC the usual procedures were employed; 
n after tlie usually accepted limit of 5 successive failures, further words were read 
CR child, who was instructed to nod or otherwise indicate if he thought he could 
w empt a definition. The necessity for such a modification of testing procedure 
et to by Kennedy-Fraser in the report on the use of the Terman-Merrill 
m elligence Scale in Scotland (1945). With Mill Hill A and B, both tests were given 
the 3 oral definition form only, and selection of synoriyms was not used. As regards 

imiting point, the same procedure was employed as with TML and WISC. 


In a previous paper (Dunsdon 


d were respectively that of the Terman 
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For all four vocabularies the scoring was 1 point for each word satisfactorily defined. 
With the WISC vocabulary this involved a modification of the usual procedure, in 
which half points may be credited for relatively poorer definitions. The bulk of the 
testing was carried out by one of us (M. I. D.), the remainder by Miss M. E. Drabble 
and Mr. J. Annett. à . 

The plan was to visit all the schools in the City of Bristol, together with some 
others outside the city boundary at which it was expected that Bristol children might 
be found. The children tested were those between the ages (in completed months) 
of 5 years 0 months and 14 years 11 months inclusive, whose birthdays fell on the 
first day of any calendar month. This provides a sample of approximately 3 per cent. 
Children whose homes lay outside the boundaries of the city were excluded. It 
was necessary to limit the survey to the ages at which school attendance is com- 
pulsory, for outside these limits it is practically impossible to avoid incomplete and 
almost inevitably biased samples. | А 

It was found during the course of the Scottish Survey (Scottish Council for 
Research in Education, 1953) that sampling by day of birth in the month is highly 
efficient for obtaining a random sample. There was no significant difference between 
the total sample and the subsample in regard to 5 variables. One important ad- 
vantage of this method of sampling over sampling by date of birth, appears when 
the testing occupies a considerable period, in this instance nearly two years. f 
sampling is by date of birth, the children are growing throughout the period E 
testing; the number of schools is finite, hence considerable bias may be introduce 
by the order in which they are visited. { t 

It was found that 10 per cent. of the boys and 7 per cent. of the girls were va 
private and independent schools; and it is shown later in this paper how greatly t le 
results would have been affected had these children been omitted. Another poss! 
-source of bias is failure to include absentees. Accordingly further visits were man 
to trace and test children who had been absent when the school was first visited. 
Some data on the effect of this are also given later. f the 

Thanks to the generous co-operation of the authorities concerned and О 0 
Headteachers, every school in the administrative area was visited, including those K 
the Education Authority and the independent and private schools. It is perhap 
possible that one or two very small private schools may have been overlooked, 
though we do not think so. The small proportion of mentally defective children 
and children excluded from school was also included. Inevitably there must yis 
some loss of children attending schools outside the boundaries of the city. Ee 
was minimized as far as possible by searching for Bristol children at bem d 
Authority schools outside the area, as well as at a number of independent schoo!» 
both residential and non-residential. f resi- 

The method of sampling is flexible. The limits of age and the place O uS 
dence are fixed in relation to the date on which the school is visited. During ing 
period of testing children are moving into and out of the city. They are melee 
in age, so that CA depends upon the date of the visit; children are also entering ат 
leaving the age-span of the survey. All this is automatically allowed for, the АП. 
representing the average for the city over a period of time and not at а given mo 
The pupils are represented in proportion to their numbers in the population: е 
the unequal numbers at different ages seen in Table I clearly reflect fluctuat 
in the birthrate between 1938 and 1949. [ways 

There are two difficulties, however. The first is that children do not à 


start school immediately after their fifth birthday; hence the youngest age SS 
$'0 to 5:5, is not completely represented. Р‹ rhaps we should have omit and we 


age-group. We were anxious, however, to establish norms down to 5 years, 
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TABLE 1. BASIC DATA 
A. Numbers: Boys, 980; Girls, 967 
B. Sums: CA (in months) and Words 


Boys Girls 
CA. d Я 112,550 110,996 
TML s " 11,747 10,791 
MHA Я А 10,451 9,743 
MHB è a 10,115 9,249 
WISC А " 15,638 14.255 


C. Sums of Squares and Cross Products: CA (in months) and Words ' 


Boys CA TML MHA MHB  WISC 
CA . . | 14,031,414 1,500,497 1,345,985 1,309,351 1,928,554 
TML . x 175,675 155,567 152,242 215,952 
MHA К 141,513 137,042 193,401 
MHB . А 135,555 188,674 
WISC. к 276,614 

Girls S Gh TML MHA MHB WISC 
CA .  .| 13,865,678 1,381,831 1,265,106 1,208,350 1,762,325 
TML . И 149,073 134,603 129,733 182,482 
МНА А 125,531 119,549 166,915 
MHB . ү 116,769 160,138 
WISG . Е 232,961 


` Corresponding figures for combinations of more than опе vocabulary can be obtained (algebraically, 
D. Sums of Words by 6 month Groups of CA 


CA Boys Girls 

Yr. Mth. | xo. | TML MHA MHB WISC| NO. | TML МНА MHB WISC 
S 0-5 43 | 233 154 138 396 | 38 | 179 18 101 — 308 
> 641 55 | 319 236 204 541 80 | 454 285 242 733 
6 0-5 34 | 402 296 283 627 | 58 | 360 255 250 578 
6-11 «9 | 48] 386 374 783 | 56 | 393 321 298 621 

7 0-5 $0 | 395 326 313 623 | 6l | 458 369 344 690 
6-11 © | 52 476 457 794 | 51 | 394 358 327 603 

8 0.5 Ыы | 3551 45 486 830 | 49 | 440 395 391 69 
6-11 б |6 618 578 954 | 4 | 402 371 350 575 

9 05 $9 | eu 51 54 895 | 53 | 532 543 503 802 
6-11 3o | 5% 54 520 808 | 57 | 609 608 548 $65 

10 0.5 $5 | 758 бо 673 989 | 55 | 687 6% 609 0915 
6-11 39 | 6 60 618 900 | 49 | 61 65 588 831 

п 0-5 40 | 905 49] 468 684 | 44 | 567 560 512 755 
6-11 40 | ei 558 335 753 | 33 | 494 470 447 595 

2 05 3 | 617 so 3n 73 | 4 | 677 596 592 802 
6-11 3 | Mo 53 55 74 | 42 | 721 66 642 823 

13 0.5 43 780 684 696 908 44 731 682 669 844 
u 61 33 | Jis єт 62 814 | 31 | 521 475 468 5% 
4 0-5 43 9 719 4 719 940 | 38 | 709 639 631 779 
6-11 40 827 761 pi 95 43 812 749 737 889 
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trust that no appreciable bias has been introduced. The second difficulty is more 
serious. This is the question of absentees on the day of the visit. The absentees 
falling within the sample were duly recorded and visits were made later—a trouble- 
some and time-consuming task from the practical point of view; with some persistent 
absentees as many as six or seven visits were required. There is also the question 
of bias being introduced by the loss of absentees. Actually, despite ae 
pursuit, 27 boys and 45 girls in the older age-groups left school before they coul : Dod 
tested. At the other end of the range, absentees just over 5 years were tested when 
they had grown rather older, and, of course, were not replaced by corresponding 
children who were less than 5 at the time of the original visit. We trust, however, 
that these losses have not had an appreciable effect. In fact, as is shown later, 1n 
our sample the difference in performance between absentees and the remainder has 
proved surprisingly small. m. k , 
Even though, however, the loss of some absentees (a loss which is not Tu Hp 
dent of CA) may not have made any appreciable difference to the final results, 2 : 
improvement in experimental design seems called for should a future survey ls 
planned on the same lines. It might be possible, during the original visits to aa н 
to define samples composed of children present that day, but absent on some ЫШ ilt 
date or dates arbitrarily selected. In this way a sample of absentees could be ve 
up and the main sample supplemented by appropriate addition of figures from 2 
absentee sample. If it proved practicable, this plan would ensure completeness: 
as well as save a great deal of time. hool 
There is one other smallloss. Twenty children, whose names were on pu Б 
registers, could not be traced. They were drawn, in the main, from families ы оо! 
make sudden and spasmodic departures from the area, without notifying the sch 
authorities of their removal or their return. | je: 
To sum up, the losses were relatively small in relation to the size of the Зыр ’ 
and, furthermore, while it cannot be said that they are unbiased, it may be hopia 
for the reasons given, that there has been no appreciable effect on the results Se Sd 
in this paper, and that the 2,000 children do represent a nearly random samP 
the schoolchildren of a large city. 
This paper deals with the first analysis; the question of norms and 
detailed examination of the scores will follow later. 


a more 


IL. RESULTS 


(a) Basic Data. Basic data for the complete samples of boys and girls ү 
tively are given in Table I. The figures given are those required for s more 
made in this paper. The actual bivariate frequency distributions will be f this 
appropriately included, to the extent that space allows, in the second paper 0 
series, which will deal with the establishment of norms. ; uj. Tt is 

(b) The Linearity of the Growth Curves: Increment of Words with ce nance 
shown in the next section that there are substantial differences in Lage ee 
between boys and girls, hence they are treated separately here. For En os is 
purpose 6-month groups of CA are used, as shown in Table I D. The 
shown in Table IT. from 

It turns out that with TML and MHB there is no significant departur ouel 
linearity of increment of mean score in either sex. This is also true of M ie exceeds 
for boys only. With girls the departure from linearity in MHA somewha ificantly 
the 1 per cent. level of significance. With WISC, the growth curve is por point. 
non-linear for both sexes, the variance ratios almost attaining the 1 per cent- 
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Пе Tus feature of the departure from linearity with WISC, as will be seen from 
gt E e a tendency for the rate of increment to fall off with advancing CA. In 
a of the similarity of the results in boys and girls, it may indeed be that this is a 
in ге of the WISC; and in establishing norms for this vocabulary it may prove 
Th visable to rely on the linear regressions. The result for MHA in girls is puzzling. 
i ere is certainly no obvious reason why the sexes should differ in this respect; and 
i spite of the high significance of the departure, we are rather inclined to think that 
y may perhaps be due to chance. It should also be noted that when MHA and 
uo P aie used with oral definition, they are intended to be combined into a single 
io a point we shall not overlook at a later stage when considering norms. It is, 
Hs fo striking that, when all four vocabularies are added together, the depar- 
As s from linearity are much below the 5 per cent. level of significance. Had there 
s re general tendency towards the same kind of curvilinearity, although non- 
n cant in some vocabularies, it should nevertheless have been revealed when they 
ks ге combined. In oral definition tests, therefore, it seems unlikely that curvilin- 

rity is a characteristic feature of increment with age. 
wai vocabulary tests were first introduced, they were intended to estimate total 
M abulary, the choice of words being made, for example, at random from a dic- 
: nary. Tt would be interesting to find, if indeed it is true, that increase in total 

Ocabulary is, on the average, a linear function of age. 

йо (с) Equations Relating CA and Words Defined. From this point onwards we 

Ve used single months as units of CA. The equations derived from the data of 


able I are as follows: 


TML Boys y=— 3-742 + 0:136956х, 
a Giris у= = 3450+ 0:127273x, 
MHA Boys y=— 4476+ 0:131826х, 
» Girls у= – 4-897 + 0:130445х, 
МНВ Воуѕ у= = 5:021 + 0:133595х, 
a Gins у= = 5:403 + 0:130398х, 
WISC Boys ?一 十 2183 + 0:119937х, 
;; ‘Girls у= + r879 + 0-1 12059x, 
Combined vocabularies Boys y = — 11:057 + 0:522314х, 
Girls y = — 11-871 + 0:500174x, 


^» » 


Where y is number of words and x is CA in completed months. 


fo (d) Sex Differences. Sex differences appear consistently in the results from all 
ri VOcabularies. These are illustrated in Figs. 1 and 2, which show the observed 

Cans for complete years of СА, together with the fitted linear regressions. 
06 The difference in mean CA between boys and girls is very small, in fact only 
сед 10115. The regressions obtained by taking single months of CA as in the pre- 
Vag” section, have nevertheless been used to adjust the means of both sexes to age 
En months, The comparison is shown in Table III. It will be seen that the 
i aus at fixed age for boys exceed those for girls in all four vocabularies. The 
m erences, ranging from 0*6 to 12 words at about 94 years, may not appear very 
8e in terms of absolute words; nevertheless all are very highly significant. 1 
sma ning to the regressions, the differences, as shown in Table IV, are much 
With ег. Boys show а more rapid rate of growth with all four vocabularies; but 
5 all except TML the difference 15 not significant : with TML it just attains the 
tiit cent. level. The difference in total increment is non-significant with each of 
Our vocabularies. Thus, the performance of boys exceeds that of girls at all 
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TABLE ПА. TESTS FOR LINEARITY OF INCREMENT OF SCORE WITH CA 
(6 MONTH GROUPS OF CA) 


У Significance 
Variation of Vocabulary D PAGES Qd eid ушш of. 
Score Freedom (words) ° (words) * Deviation 
TML Boys 
Linear regression . 4 1 20,710:3 
Deviations from L.R. Р 18 393-4 21-86 1-52 = 
Within arrays . ; 3 960 13.763:1 14-34 
Total è Р 979 34,866-8 
TML Girls 
Linear regression  . А 1 18,184-8 T 
Deviations from LR. — . 18 218-1 12-12 1-12 
Within arrays . 5 , 947 10,250-6 10-82 
Total * А 966 28,653:5 
| T 
MHA Boys | 
Linear regression . : 1 19,213-4 Si 
Deviations from L.R. к 18 272-7 16:26 1:48 
Within arrays . x А 960 10,554۰4 10-99 —" 
Total А * 979 30.060-5 
МНА Girls 
Linear regression . * 1 19,136:5 deo 
Deviations from L.R. , 18 316:1 17-56 2:10 t 
Within arrays . М Ё 947 7,912:9 8:36 
Total P + 966 27,365-5 
МНВ Воуз 
Linear regression . " i 19,693-2 = 
Deviations from L.R. — . 18 257-8 14:32 1:23 
Within arrays . И г 960 11,202:7 11:67 m 
Total 5 ә 979 | 31,537 
MHB Girls 
Linear regression Я А 1 19,134-6 — 
Deviations from L.R. — 18 190-0 10:56 111 
Within arrays . к í 947 8,981-1 9-48 
Total  . . 966 28.305-7 Se 
WISC Boys | | 
Linear regression . " 1 15,858۰3 + 
Deviations from L.R. — | 18 378-6 21-03 186 
Within arrays . Я 4 960 10,839-3 11-29 j 
Total . | 979 270762 
WISC Girls 
Linear regression . 1 14,086۰3 | 
Deviations from L.R. í 18 3,08-1 1712 | 1-92 
Within arrays . : i 947 8427-0 8-90 ae 
Total. , 966 | 228214 cd 
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TABLE IIB. TESTS FOR LINEARITY OF INCREMENT OF SCORE WITH CA 
(6 MONTH GROUPS OF CA): TOTALS FOR BOYS AND GIRLS 


Variati К Degrees Sum of Mean ; Significanc 
ашарын Vocabulary oF Squares | Square h of x 
оға Freedom | (words)? | (words)? ано Deviation 
TOTAL, All Vocabularies. Boys. 
Linear regression 1 301,166 
eviations from L.R. 18 4.089 227-2 1-33 = 
ithin arrays , 960 163,635 170-5 
Total š 979 468,890 
TOTAL. All Vocabularies. Girls. Н 
Linear regression 1 | 1 
eviations from L.R. 18 2,779 154-4 1-20 = 
ithin arrays 947 121,756 128-6 
Total 966 405,646 


e ages studied; the difference, however, does not widen greatly with advancing 


f It seems clear, therefore, that tests involving oral definition as such definitely 
avour boys; it is not merely a question of a particular vocabulary being sex-biased. 


TABLE Ш. MEAN NUMBER OF WORDS DEFINED AT FIXED CA 


(114:8 MONTHS) 


ii 
А i S.E. of j 
Vocabulary Boys Girls seen cols Di  Dif/s.E. 
TML 11-980 11-161 + 0-819 01610 — 509 
MHA 10:658 — 10-078 十 0.580 0-1419 4-09 
VB 10315 9-567 十 0.748 0-475 5-07 
Isc 15-952. — 14-743 十 1-208 0-1448 834 
Total `. |  4$905 45-549 4- 3:356 0.5547 605 
i TABLE IV. INCREMENT IN WORDS PER YEAR OF CA 
: Difference S.E. of " 
Vocabulary Boys Girls Вуз 2 Girls pim  DIf./S.E. 
ا‎ y 
TML j 
à 1:527 十 0-116 0-0571 2.03 
МНА ES 1-565 十 0-017 0.0504 0.34 
wie 1-603 1565 | +0038 0.023 073 
A 1:439 1345 十 0-094 0-0514 |: 
Total ` 6267 6-002 十 0-265 0-1969 135 


= Personal communication from Mr. Raven is very pertinent at this point. He 
ells us that, in making up MHA and MHB, he was careful to reject any words 
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BOYS g 一 一 
GIRLS O --- 


WORDS 


5 
0 A т 14+ 
5+ 10+ 14+ 5+ 10+ 
YEARS YEARS Я 
A. Vocabulary form Terman-Merrill Scale, B. Vocabulary from Wechsler Intelligence 
Form L. Scale for Children. 


WORDS 


zu 10+ 14+ 
YEARS YEARS B 
C. Mill Hill Vocabulary Scale A. D. Mill Hill Vocabulary Scale Р. 


ions. 
Fic. 1.—Mean Numbers of Woris Defined by Years of CA, with Fitted Linear Regression: 
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100 
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GIRLS Ө === 


80 


60 


WORDS 


40 


20 


10+ + 


+ 
ш YEARS 


ombined. Mean Numter of Words Defined by Years of CA, 


Fic, 2 А 
Фес ularies c [ 
в: тинн with Fitted Linear Regressions. 


slightly more familiar to one or othersex. Never- 
vocabularies is much the same as that shown by 
Terman and Dr. Wechsler inform us, no 


Which he suspected might be even 

less, the sex difference in his 

L and WISC, in which, as Professor 
Such exclusion of words was practised. 


Another sex difference, of a more familiar kind, also appears in our sample, 


it has | variable in performance than are girls; 
ofte d that boys are more | : : 
and this л jn a Ыг way from the present data. The detailed variances 
Е It is hoped to make more detailed use of this 


neo CA are shown in Table V. 
Ormation-in another paper. 
the (е) Intercorrelations of the 1 

Scores on the four vocabularies а 
they The intercorrelations seem reason 
n B refer to tests taking only a few min 

C ability coefficients based on the rep 
Mentioned in the previous paper, seem 


Four Vocabularies. The intercorrelations between 
t fixed CA are shown in Table VI. 

bly high, particularly when it is recalled that 
utes to give. Naturally they are lower than 
etition of the same vocabulary; these, as 
to be generally about ‘9. In a sense, the 
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intercorrelations may be regarded as another kind of reliability coefficient, measuring 
the consistency of performance in the task of word definition generally, apart from 
consistency in regard to a particular set of words. m 
The intercorrelations are appreciably higher with the boys. No doubt this is а 
reflection of the wider dispersion of their performance, as shown in Table V. 


TABLE V. VARIANCE OF WORDS DEFINED AT FIXED CA 


Д Ratio 
Boys Girls Boys/Girls 

TML А s 14-45 10-81 1-34 

МНА х Я 11-09 8-52 1:30 

MHB в + 11-68 9-51 1-23 

WEE , * 11-43 9-01 1-27 

Total . * 171-09 128-67 1:33 


For both boys and girls the intercorrelations show relatively little variation а 
magnitude, except that, as might be anticipated, the correlation between the two form 
of the Mill Hill test (MHA and MHB) gives the highest figures. 


TABLE VI. INTERCORRELATIONS OF THE SCORES ON THE FOUR VOCABULARIES 


Partial Correlations at fixed CA 


Boys MHA MHB WISC 
TML So g -8347 8476 :8233 
MHA :8717 -8314 
MHB -8457 

Girls MHA MHB WISC 
TML А 4 "7775 :8024 7130 
MHA :8317 -8094 
MHB -8234 


(f) Comparison with the 
for TML only, with those 
1953). For the means, a 
the figures are as follows: 


ared, 
Previous Sample. The present results can be сотратч 
obtained from the previous sample (Dunsdon and pes 
djusting to 10-80 years, as used with the previous 5 


Present sample Previous sample 
Boy > 1401. 13-43 
Girls 13-04 12:81 
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The figures for increment of words per year of CA are: 


Present sample Previous sample 
Boys 1-64 1-67 
Girls 1:53 1-55 


The regressions are thus almost identical; but the mean performance of the present 
Sample of children is ‘58 + -28 words higher in boys and -23 + -27 words higher in 
girls. In view of the very different ways in which the samples were made up, it is 
Satisfactory to find that the discrepancies are so small. The sample of 500 test- 
Tesults used in the previous paper was made up from available report-forms for 
Children who had been tested for a wide variety of purposes, these being selected 
(using random processes) so as to give a group with a mean I.Q. of 103 and a standard 
deviation of 17. А large number of the forms, however, were those of brothers and 
Sisters of mental defectives, and few independent school children were included. 
It might well have been anticipated that a highly verbal test, such as vocabulary, 
Would reflect the lower social background, which such a sample might be expected 
to show, more clearly than would the Terman-Merrill Scale as a whole. It is, there- 
Ore, reassuring to find that the difference between the results in the earlier sample 
and those obtained from an effectively random sample is barely significant in the 
boys and not significant in the girls. The sex difference pointed out in the previous 
Paper is now, of course, amply confirmed. 

It may also be noted that the correlation between TML vocabulary score and 
MA on the whole scale, at fixed age, as then discovered, was -84—a figure which is 
of the same order of magnitude as the correlation of the TML vocabulary score 
with the three other oral definition tests, as shown in Table VI. 


111. THE INCLUSION OF INDEPENDENT AND 
PRIVATE SCHOOLS 


In our sample 10 per cent. of the boys and 7 per cent. of the girls attended inde- 
Pendent and private schools. The results for these children separately are not of 
Much general interest. In other cities the proportions and kinds of children attend- 
Ing these schools might well be quite different. Moreover, there are features in the 
Selection from these schools which make the results somewhat unreal. For example, 
Mean age is substantially higher than in the remainder—19 months in the boys and 

months in the girls. This reflects a tendency for children to enter these schools 
at later ages—for example, the very bright children awarded special places as the 
result of the examination at 11+. The regressions therefore are much distorted. 

hat is important is to see what the effect would be if a survey of the present kind 
Were to be restricted to children attending the Local Education Authorities schools. 
t is sufficient to quote the results for one vocabulary only, namely, TML. For 
ys, the figures are as follows: 
Sample omitting 
Whole Sample Independent and 
Private Schools 


Mean score at 114-8 months . = 11:98 11:37 
Increment of words рег узаг . x 64 1-46 
Variance of words at fixed СА. . 1445 10-14 
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The differences are rather less for the girls, but still substantial. It is clear, therefore, 
that the omission of independent and private schools would very seriously alter the 
results, and lead to norms that would be appreciably different from those of the 
whole child population. 


IV. ABSENTEES 


The results for absentees (74 boys, 76 girls) are in sharp contrast to those just 
described. Absentees are lower in performance; but the differences are not great, 
and omission would in fact have made relatively little difference. The results, again 
for TML, are as follows: 


Sample omitting 


Boys Whole Sample Absentees 
Mean words at 114-8 months . . 11-98 11:99 
Increment of words per year . . 164 1-66 
Variance at fixed СА * i . 1445 14-61 
Girls 

Mean words at 114-8 months . . 11-16 11-22 
Increment of words per year . „153 1-55 
Variance at fixed CA `. š . 1081 10:97 


A number of earlier studies have shown that children absent on the date ol 
school visiting are often appreciably lower in score than the remainder; and уе 
were surprised to find relatively modest differences in the present sample. It may 
that the pattern of absenteeism is changing. To mention two possible reason 
this, there are the material benefits now offered by the schools (for example, free 
meals), and there is the fact that more mothers are out at work during the day: 
Nevertheless, one would hesitate to suggest that absentees could safely be omitted 
from a survey of this kind—at least until the relative lack of difference had been Ee 
firmed with other samples in other areas. It may be that our sample of absente", 
proved rather peculiar in this respect; and, until a good deal of confirmatory per 
is forthcoming, there would always be a doubt whether the results had not Ё 0 
fairly seriously distorted. Moreover, even in our sample, the effect of omission a 
absentees, though relatively small, cannot, in the girls at least, be called neglig! 


V. SUMMARY 


а ol 

1. А 3 рег cent. sample of the schoolchildren of the City and County of ва 
was selected, by visiting all schools (including а number outside the city) and сћоо® the 
those children, between the ages of 5-0 to 14-11 years, whose homes were W! d $ 
city and whose birthdays fell on the first day of any calendar month. Four YeVoca- 
lary scales were used: that from the Terman-Merrill Scale, Form L; Mill ies 
ularies A and B; and that from the Wechsler Intelligence Scale for Children- 


2. The present pa Тү Ч : t of norm 
ee dealt with Inter. Paper gives the first results. The establishmen 
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3. All four vocabularies proved to be sex-biased in favour of boys, who on the 
average defined more words at every age studied. The differences were fairly sub- 
Stantial; and all were highly significant. 

4. In 5 out of 8 comparisons given by 2 sexes and 4 vocabularies the increment 
of score with CA showed no significant departure from linearity. Three showed 
Some divergence. There was no significant departure from linearity in the com- 
bined score for all vocabularies. 

5. The intercorrelations of the four scores, reduced to fixed age, were of the 
order of rather more than ‘8; they showed little variation in magnitude. 

6. It is shown that omission of children attending private and independent 
Schools would have distorted the results very seriously: on the other hand, absentees 
on the dates of the original visits to the school were not greatly lower in performance 
than the remainder, and their omission, in this particular sample, would not have 
altered the results very appreciably. It is not suggested, however, that this would be 
1n general a safe procedure. 
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AN ADDITIVE METRIC FROM ALL 
THE PRINCIPAL COMPONENTS OF 
A PERFECT SCALE" 


By LOUIS GUTTMAN 
Israel Institute of Applied Social Research 


Abstract. А new type of metric is established for perfect scales that is different from 
the three Previously known types of metrics. Given a perfect scale of m types of dicho- 
lomous items, each rank of persons is regarded as a point in an m-dimensional space defined 

Y the m non-constant principal components. A non-Euclidean distance function is defined 
for this Space. It is proved that the resulting metric is additive : the non-Euclidean distance 

etween any two ranks i and £ is the sum of the distances from i to j and from j to k when- 
ever i = j 5 k. Treating the principal component space as non-Euclidean may also be 
useful for the study of non-scale structures and of quasi-scales. 


1. Introduction. 11. A Numerical Example of Principal Component Scores. III. Making 

each Principal Component’s Variance Proportional to its 12. IV. The Non-Euclidean Distance 

unction and the New Additive Metric. V. Possible Applications for Non-Scalable Universes 
and for Quasi-Scales. VI. Proof of the Additive Property of the New Metric. 


I. INTRODUCTION 


РЯ Three different meaningful metrics are known for a perfect scale, and have been de- 
Cribed elsewhere [2, 3]. They are, respectively: (a) the basic rank order of the population ; 
et first principal component scores, which maintain the basic rank order, but stretch 
def Contract differences in rank to give а best metric in a certain sense of least squares (as 
S ned in [1]; (c) ‘absolute psychological types,’ or the intervals along the basic rank 
rder, defined by the bending points of the successive principal component curves. 
ty In the present paper we shall prove that every perfect scale always has also a fourth 
are Of metric, and that this is an additive metric. The new metric is computed in an exact 
Б &ebraic fashion from all the principal components of the perfect scale. In this new metric, 
to^ and К are any three scale ranks, and if j is between i and k, then the distance from i 
15 the sum of the distances from i to j and from j to k. А 
а € shall first illustrate the new metric and its additive property by a numerical example, 
Pa Shall give it an interpretation. Then we shall prove algebraically that the additive 
Toperty holds for all perfect scales. ; 


П. A NUMERICAL EXAMPLE OF PRINCIPAL 
COMPONENT SCORES 


For illustrative purposes, let us take the example published in [2] of a perfect scale of 
nu Уреѕ of dichotomous items, where all the principal components have been worked out 
merically。 Five types of dichotomous items yield six ranks (types) for persons and five 


1 А А " H A 
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principal component scores for each rank, apart from a constant score (on the constant 
principal component). Given that the types of items are all equally frequent in the universe 
of items, and that the types of persons are all equally frequent in the population of individuals 


(respondents), the principal component scores of the scale turn out to be numerically as in 
Table I. 


TABLE I. RAW PRINCIPAL COMPONENT SCORES FOR EACH RANK OF 


PERSONS?! 
Principal Component 
Rank of Persons 

I I1 ш IV У 

5 5 5 5 1 1 

4 3 =I =71 一 3 一 5 

3 1 —4 —4 2 10 

2 =i —4 4 2 —10 

1 -3 —1 7 —3 5 

0 =S 5 一 各 1 —1 

n? `6 2 1 ۰06 ۰04 

Raw sum of squares 70 84 180 28 252 


1 From [2], Table IV, p. 322. 


We have copied the principal components’ scores in Table I in the same raw form as 
given in [2, p. 322]. These are raw scores in the following sense: each principal componen 
is a latent vector of a certain matrix. Latent vectors are defined only up to a constan d 
proportionality, that is, multiplying any column of scores in Table I through by any (non-zero 
constant will yield new scores on what is actually the same principal component; geometr! 
ally, the direction of the latent vector remains unchanged—except possibly for Sign 一 ane 
only its length is affected. The length of any given latent vector in Table I is the squat 
root of the corresponding sum of squares shown in the last row of the table. 


II. MAKING EACH PRINCIPAL COMPONENT'S 
VARIANCE PROPORTIONAL TO ITS 7° 


For purposes of exposition in [2] the raw scores in Table I were deliberately compute? 
to be in exact whole numbers. This has made the raw sums of squares—and hence с 
lengths of the corresponding latent vectors—arbitrarily unequal. For some theoreti is 
purposes, it is useful to make the lengths of all latent vectors equal, say equal to 1; the 
can be done for the data of Table I by dividing each raw score in a given column by ew 
Square root of its raw sum of squares. For each column the sum of squares of the’ 
scores will then always be 1. ify the 

For our present purpose, as a first step toward our new metric, we need to modify ^^ 
lengths of the latent vectors in a different manner, namely, to make each new sum of s4 ~e 
equal to the corresponding 2°. One motive for this is as follows. The components 
arrived at in the first place from a maximization problem: given the responses of à POP in- 
tion of individuals to a universe of items, find that set of scores—or metric—for ШШ res 
dividuals which will maximize the correlation ratio over items [1]. The maximizing S°% sr 
turn out to be the first principal component of the set of responses, and so give th? үү 
metric in this sense of maximization. But the same equations show that there also they 
next best down to worst principal components, cs ranked by the correlation ratios the 
Yield. The sum of squares of all correlation ratios is 1 [cf. 2, p. 322], showing t 
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total variance of the responses can be accounted for exactly and in a linear fashion by all 
the principal components. (The linear equations for reproducing the original item re- 
Sponses exactly from all the principal component scores are illustrated in [2, pp. 331 f.].) 
In this sense (and since the principal components for a perfect scale are always orthogonal 
to each other), the 7° associated with a given principal component expresses its proportional 
contribution to the total variance of the scale. 

Therefore, in making the length of each principal component vector equal to the 7 of 
that component, we are expressing each component in a way which reflects its importance 
for accounting for the observed data in a linear fashion. Тһе first component is given the 
largest variance, the last component the smallest variance, and intermediate components 
variances of intermediate size. 

To convert the raw scores of Table I into the proportional form that we desire, it is 
necessary simply to divide each row by the square root of its raw sum of squares and then 
multiply by 7. This can be done in one step by multiplying each row by the quantity 
v (n*/(raw sum of squares). Thus, each raw score іп the first column of Table I is to be 
multiplied by /{-6/70} = ‘09258. For the remaining columns the multipliers аге /{:2/84), 
V 01/180), 4/(06/28), and v {04/252} respectively. The new scores are shown in Table II. 

Here we are omitting the constant component throughout, because it does not affect 
Our new metric. The correlation ratio is actually zero for the constant component, although 
its latent root is 1; for each of the other principal components, 7° is always equal to the 


Corresponding latent root. 


TABLE П. PRINCIPAL COMPONENT SCORES! 
Expressed with Sum of Squares equal to 7* 


Principal Component 
Rank of Persons | 一 P 
I II III IV т 
5 4629 2440 1179 -0463 :0126 
4 27711 | —:0488 —160 | —-1389 — 0630 
3 0926 —1952 —:0943 -0926 :1260 
2 —:0926 —:1952 +0943 -0926 —:1260 
1 —:2771 —:0488 -1650 一 "1389 “0630 
0 —:4629 2440 1179 0463 —:0126 
Mm 6 2 1 06 04 
Sum of squares 6 2 1 ۰06 ۰04 


! Computed from Table I by multiplying each column of raw scores of Table I by 
V/(r*/(raw sum of squares of that column)). 


IV. THE NON-EUCLIDEAN DISTANCE FUNCTION 
AND THE NEW ADDITIVE METRIC 


(n We are now in a position to introduce our new overall metric. In Table II, each rank 

OW) has five Component scores. Since each pair of components is orthogonal over all 

Persons, the five components can be regarded as providing an orthogonal coordinate system 

Point iy (eimensional Euclidean space, and each rank of persons can be thought of as a 
IS five-space. ] В 

Th t us now Rm about the distance between each pair of ranks in this five-space. 

€ Euclidean definition of distance between two points requires us to take the square root 
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of the sum of squares of the differences between the corresponding coordinates of the two 
points. Thus, in the five-space of Table II, the Euclidean distance between ranks 5 and 4 
is \ C24), since 
(4629 — 2771)? + (:2440 + 0488): + (1179 +-1650)? + (-0463 :--1389)? + (0126 + -0630)° = 24. 
For the purposes of our new metric, it turns out to be more profitable to regard the 
five-space of Table II as a non-Euclidean space, by defining the distance between two points 
to be the square of the Euclidean distance: that is, we shall dispense with taking square 
roots as required by the Euclidean metric. Accordingly, our non-Euclidean distance 
between ranks 5 and 4 is -24, in distinction to the Euclidean distance of ү (24). The non- 
Euclidean distances between each pair of ranks of Table II are given in Table III. 


TABLE HI. THE NON-EUCLIDEAN DISTANCES 


The Euclidean-Distances-Squared between each Pair of Ranks in the Five-Space defined by the 
Principal Components in Table II 


Rank of Persons 


5 0 24 39 зз 67 м 
4 | -24 0 15 — 38 43 67 
3 | .39 15 0 3 28 .32 
2 .52 25 43 0 ds 539 
| 67 43 28 — 45 0 24 
0 D -67 52 39 м4 0 


A striking feature of Table III is immediately apparent. If i < j < k, then the distance 
from rank К to rank i is greater than the distance from rank k to rank j. Thus, the distance 
between rank 4 and rank 0 is greater than the distance between rank 4 and rank 1 (67 as 
compared with -43). But the distances between adjacent ranks are not necessarily equal; 
oe rank 5 is farther from rank 4 than rank 4 is from rank 3 (-24 as compare with 
,. The properties described in the preceding paragraph would hold also were the Euclidean 
distance function (or any other monotonely related distance function) to be used. Е 
particular non-Euclidean distance function has the following distinctive feature: the distance 
between any two ranks i and К is precisely the sum of the distances from i to j and from 
to k, where j is any rank intermediate to i and k. For example, rank 3 is intermediate p 
ranks 4 and 1; the distance from rank 4 to rank 3 is -15, while from rank 3 to rank 1 e 
28. We have ‘15 + -28 = -43; and -43 is also the distance between ranks 4 and 1 as СО 
puted independently from Table II and recorded in Table III. 


24 IS G «5 24 


— ——À—Ó—À—— 


1 | 
| € 38 4 —$ 
Rank of persons Any 
Fic. 1. —The Ranks of Persons Plotted on a Straight Line with Intervals as Defined by the non 
Euclidean Distance Function of Table III. 


This additive property of our non-Euclidean distance function implies that We the 
mark off our ranks of persons as points along a straight line, the distances betwee? 
points being proportional to the values given by this distance function, as shown !n Kwo 
All the entries in Table III can be reproduced from, Fig. 1. The distance between any 
ranks is the sum of the lengths of all intervals between them in Fig. 1. 
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The new metric portrayed in Fig. 1 maintains the original scale rank order of the types 
of persons, just as do the previously known three other kinds of metric. It gives distances 
between the ranks which are different than from those given by the metric that is best in 
the sense of least squares (the first principal component). The least-squares metric is 
derived by obtaining projections of the ranks on a best single continuum. Our new metric 
looks at all the ranks as in a complete multi-dimensional space, and does not take projections 
Оп only one of the components of that space. We now see that, if we measure distances 
in a certain non-Euclidean manner, the scale ranks all lie on a * straight line " in the resulting 
non-Euclidean space. 


V. POSSIBLE APPLICATIONS FOR NON-SCALABLE 
UNIVERSES AND FOR QUASI-SCALES 


It is not clear at present what further implications the new metric has for the theory of 
perfect scales as such, or for the psychological interpretations of Scales (as from the metric 
of absolute psychological types discussed in [3]. t may have important applications in 
the analysis of zon-scalable data. "e 

The general equations for the principal components of qualitative data hold whether 
the data form a perfect scale or not, and indeed were derived in [1] before the theory of 
Perfect scales of [2] was developed. Any set of qualitative items can be regarded as estab- 
lishing an n-dimensional coordinate system of principal components for any population of 
Tespondents; and each person can be regarded as a point in this space. Tables analogous 
to Tables IT and III above can be computed. Inspecting the distances between the persons 
San sometimes show that the points lie on some simple surface, if not on a straight line as 
In the case of a scale. This would give a means of analysing simple non-scale structures. 

t may turn out that, for some non-scale structures, a different non-Euclidean distance 
function may be more appropriate than the present function. The new type of metric may 
also prove helpful in studying quasi-scales. A problem to be explored is the effect on the 
New metric of errors of reproducibility. It may be that the new metric actually defines a 
Class of quasi-scales, for it may possibly be perfectly additive for data which do not form a 
Perfect scale, according to the mathematical analysis below. 


VL PROOF OF THE ADDITIVE PROPERTY OF THE NEW 
METRIC 


The numerical example worked out above was for a special case of five types of items, 
and where uniform frequencies (rectangular distribution) were assumed both for the types 
Of persons and the types of items. Different frequencies would yield different principal 
COmponent scores from those of our example. We shall now show that the additive property 
holds for any frequency distribution, and for any number of items. While our analysis is 

he same as for dichotomous items, this entails no real loss of generality; for any item can 
© reduced to dichotomous form, or expressed as a set of dichotomies. > 

We shall use the results of [2] and essentially the same notation. As in [2], we may 
Consider a perfect scale of m types of dichotomous items, yielding m + 1 types of persons 
ranked from 0 to m, and which is resolved into a set of m + 1 principal components. Let 
! denote the relative frequency of the ith rank of persons (i = 0, 1, . . . , m), and let хш 

Enote the score of the ith rank on the ath principal component (a, i = 0, 1, .. ， m). 
ы Еог Convenience, the numbering of the principal components will be regarded as being 
in ODposite order of the size of the corresponding latent roots. In particular, а = 0 denotes 
the Constant principal component (with latent root 1). Since the component scores are 
determined only up to a constant of proportionality, we can normalise them so that 


(@=0,1,...,m. (р 
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In [2] it has been proved that ail latent roots are distinct, so that any two different principal 
component score vectors are necessarily orthogonal to each other with weight function fi- 
Given also (1), the m + 1 components yield an orthonormal set, or 


m 
> Лхаіхы = Sab (qb = 0,1, 0 a 08), (2) 
i-0 


where Sab is Kronecker's delta (equals 1 if a = b, and 0 if a + b). 
Equations (2) imply that the square matrix of order m + 1, || /(fi)xai ||, is an orthogonal 
matrix. This matrix must then be orthonormal by rows as well as by columns, i.e. 


fi 5 XaiXaj = 8 (LJ = 0) bas ss Е (3) 
а0 


Let 74? be the correlation ratio squared of the ath principal component. Then 7], 一 0; 
for the constant component has no variation, although the latent root for this component 1S 
1. Our new metric, as illustrated in sect. IV above, defines the non-Euclidean distance 
between two ranks i and j to be d(i, j), where, recalling we already have normalised the Xa 
according to (1), 


«i, j) = > 7] (Xaj 一 Xai)? re Lo sati) (4) 


This metric will be additive if, and only if, for any three ranks i, j, and k (where i = j = k) 
the following property holds: 


dli, k) = dli, j) + d(j, k) (isjsk. ©) 
Now, the following identity is obvious: 
Xak 一 Xai = (Xak = Xaj) + (Xaj = Xai). (6) 


We can square both members, multiply through by 74, sum over а, and use (4) to obtain 
the further identity 


dli, k) = dli, j) + dj, k) +2 > К а д 
(Lj kom0,1...,m. ©) 


Clearly (7) will reduce to (5) if, and only if, the last member of (7) vanishes when- 
ever ix js К, or 


m 
> "lé Qi — xa) — Xai) = 0 sjak (G9 
< 
Thus our task will be ended if we prove that (8) always holds for a perfect scale. . Axat Be 
As a preliminary to the proof, we establish a sufficient condition for (8). Let ха 
defined as the difference 
Axai = Xa, i+1 — Xai G@=0,1,...,m—1). (9) 
ê shall 


Differencing will always be with respect to the second subscript, so instead of 4, wi 
write A throughout. If > i, then we can write, using (9), 


Жее 
Хај — Xai— У Аха, ins g>). ao 
s=0 
Similarly, if k > j, we can write 
k-j-1 Р 
Xak — ха = У AXaja 5 kop. a 
1=0 
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Using (11) and (10) in (8), we can write 
n" 1-1-1 k-j-ly m 
È (хак — Xaj)(Xaj — ха) = 5 È (= талха, Лха, st) 
a=l s=0 t=0 \a=1 
(i<j<k. (12) 
Let pij be defined by: 


m 


py = Y xaxa CJS essa OS 
а=1 


Clearly, a sufficient condition for the righthand member of (12) to vanish is that pis, ;.., = 0 
for all s and z in the given ranges for the respective i, j, апа k. Surely, then, a sufficient 
condition for the right member of (12) always to vanish is that: 


ру = 0 (=; 1) = 0, 1,.... т 1). (14) 

If we prove (14), we shall have proved (8) via (12), and hence (5) via (7). The left member 
of (8) is identically zero if i = j or j = К, and hence we need consider the case only where 
1 <j < К, as we shall now do via (14). | | 

,. ,For the proof of (14), we shall use the difference equations of the perfect scale derived 
in [2]. These equations can be written in the following form, suitable to our present pur- 


Poses [cf. 2, p. 345, equations (51)-(53)]: 


Ae = o ees (а= 1,2,...,п) (15) 
Na 
1 i120,1,...,m—2 
Ci+1AXa, i41 — ci Nai = 一 DELL ist (= l9 E en (16) 
TT UMEN m (а= 1,2,..., т). (17) 


Na? 

We have modified the notation slightly from that in [2]. Our present c; are n/N times 
those of [2, equation (34)], and our 1/74? are n/N times the ¢ of [2, equation (35)], except 
for when a = 0. It is essential to note that 1/7 is nor defined in our present notation, 
because 170° = 0, although фо is defined as ф = 0 in [2, рр. 345 f.]. Our present equations 
(15) through (17) are not stated for the constant component, to which we have to give 
Special attention in the algebra that follows. 


Let x, be the common constant value of the ы (1 == 0, 1,5 5 
m 


(1) above, Xo is fixed by x" SA = |. Explicitly, if > f; = N as in the notation of [2, equation 
= 120 Е 
(9), then Xo = V(I/N). "From (3), if we take the case a = 0 out of the summation and 


Write it separately, we can state for i + j that 


. m). Under condition 


ЛЎ хаха = — fort ел ho 0. к 55m). (18) 
acl 
Taking the first difference of both members of (18) with respect to j, and dividing by fi, 
Shows that 
m xd у $=0) Lo sg? ) 
> хаћхај = 0 (i+ j t1s 70, e ЛИХ (19) 


=1 
= "Axaj, sum over а, and 
‘Ow, if we multiply both members of (15) through by 7esAxah， $ use 
Notation (13), and por С (19), we see that, since no c; can vanish, 
G=1,2,...,m—1). (20) 


ру = 0 А | 
Multiplying both members of (16) through by 7Ja’Axaj and summing over a will similarly 
that 
B= 0), aa nig I — BD 
ol or . 2 
Сері, — сіру = 0 (i # jj +1 . 450,1, ag 0 小 (20 
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Stating (21) for i = О, and using (20), now shows that 


ру = 0 (j 292,3, = ад T). (22) 
Stating (21) for i = 1, and using (22), etc., shows that in general 
=“. Ba asap 2) 
ines yt ‚1, > . Q3 
ше (>i j-0,1,...,m—1 


But ру = pji from (13). Hence (23) is equivalent to (14), and we have completed the proof 
of the additive property (5) of our new metric. : 

It may be remarked that property (14) shows that, from (13), || aMxai || is a square 
matrix of order m that is orthogonal by columns. The Axa; are closely related to the principal 
components of the categories of the items in the scale, as shown by [2, equation (30)]; the 
weight function required for orthogonalization by rows is correspondingly implied by 
[2, equation (94)]. 

A full treatment of the related problem of category components, when items are not 
dichotomous but quantitative variables, is given in [4]. 
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THE CORRELATION BETWEEN VARIATE- 
VALUES AND RANKS IN SAMPLES FROM 
DISTRIBUTIONS HAVING NO VARIANCE 


By ALAN STUART 
Division of Research Techniques, London School of Economics 


Fix 1. In [2], it was shown that in samples of size from a continuous distribution function 
(X) with mean ш and variance o°, the correlation between variate-values and the corre- 


Sponding ranks is given by | 
_ Л Du xF(x) dF(x) — $n 
6-1 Ems 5] 4] - 8 (1) 


We now consider the problem in the case when the distribution has no variance. 

. „ Moments can fail to exist only if the range of the distribution is either singly- or doubly- 
infinite, In the first place, we consider only a portion of the distribution defined on an 
arbitrarily wide range (a, b). For this interval, (1) holds, with a and b as the limits of 
Integration in VxFdF and in the integrals defining p and o°. 

b 2. Consider first the case of a singly-infinite range, say upwards. We may take a to 
(b the lower terminal of the distribution, and take the limit of (1) as b— со. If we write 

as 
Cn = K.R, (2) 


We have 
b 


[[xrar- à | х а} 


Lim К° = Lim -; E (3) 


Ü x2 dF — (| x a] 


a 


^ the mean of the distribution exists, the numerator of (3) has a finite limit; for | xF df con- 
erges whenever VxdF does, since 0 < F< 1. And since the denominator of (3) diverges 


Y hypothesis, the ratio Ra tends to zero. | 
The problem becomes non-trivial when the mean, as well as the variance, does not 


d We then apply L'Hópital's rule to (3), differentiating numerator and denominator to 
ain 
b b 
AFO) — {| xF AF — 4 [а 
Lim R? = Lim => Е , 
fo = 2| x ағ} 

апа о | | a 

n repeating the operation, " л 

fo) [scr — » + {[ rar -3| xar} ] 
e a a 
Lim R? = Lim аы 1 — 2bf(b) . (4) 


The Correlation between Variate-Values 


Now if there exists a value of x, say X, such that 
FOO forall x>X, 


i.e., such that the frequency function is monotone, decreasing beyond some point, it will 
follow from a theorem quoted in [1], p. 124 that, even though the mean does not exist, 


Lim bf(b) = 0. (5 
Substituting (5) in (4), we find 


z " 3 и: (6) 
Lim А Lim WHO) 
Applying L'Hópital's rule again to (6), we obtain 
A cua d 3 
Lim. R = Lim ZÊ d uu n 
and since 
Lim F(b) = 1, 
(7) becomes 
3 
Lim R? = — Lim UO, 
ау Uy (8) 
= — Lim БД). f 
From (5), using L'Hópital's rule again, 
. b ， {f(b)}* (9) 
0 — Lim URB Lim F0) 
Putting (5) and (9) into (8), we have, using (2). 
Lim C, = Lim R? = 0. (10) 
An example of the type of distribution discussed is 
ано) ос E, (0< х < oo) 
К 3 
dx a 
Which has no mean since | ES diverges. 
0 
7 t 
3. The same result holds for a doubly-infinite range if we write a = — b and interpre 
the integrals as principal values. For Since 
b 
d d 
ab | g(x) dx = 2% | g(x) ах, 
Sb A 


; es n 
the effect of having a doubly infinite range is simply to introduce a factor 2 whenever fo 


integral is differentiated, and this does not change the value of the limit at (10). Thus, ® 
example, the Cauchy distribution 


Р ах 
ills dod =, 


* de 
Which has no moments and its monotone in is tails, has C, = 0 by (10). We O de 
that, for any continuous distribution with no moments which is ultimately mono 
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the correlation between variate-values and ranks is zero. However, this property does not 
characterize such distributions, for in [2] it was shown that for Gamma variates with para- 


meter m (> 0), 

lim C, = 0, 

т->0 
and that generally, when the variance exists, a necessary condition for C, = 0 is that Дх) 
be unbounded at some point within its range. > 
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ERRATA 


„The following corrections should be made in the article on * The Correlation between 
Variate-values and Ranks in Samples from a Continuous Distribution,’ this Journal, VII, 


Pt. I, pp. 37-44. 


- P. 40, eqn. (22): Right side should be 1/(2.33), not 1/(2.3)!. 

- P. 41, below eqn. (24): delete * a result . . . much earlier." к 

- P. 42, end of Section V should continue: * The result (24) for ғ іп the normal case has 
been given by Burt (1953), who obtained it earlier." 

- P. 43, Example 1, Line 5: delete * іп virtue of section У.’ 

- P. 43, Example 1, Lines 6 and 7, delete: * asymptotically ' (twice). —— у 

- P. 43, Example 1, second paragraph, Line 2: The result for asymptotic relative efficiency 


Should read (С?)# = (3/7)*, not С? = (3/7) as printed. 
ALAN STUART. 
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A NOTE ON THE USE OF BURT'S FORMULA 
FOR ESTIMATING FACTOR SIGNIFICANCE 


By EDWARD E. CURETON 


Burt ! gives as an approximate formula for the standard error of a centroid or simple-summation 
factor loading : 


са = (0—a)v(QliNn—s DN . . . . D 


where a is the loading, N is the number of subjects in the sample, п is the number of tests or variables, 
and s is the ordinal number of the factor. ú 
Vernon suggests that the sth factor may be considered insignificant whenever at least half its 
loadings fall short numerically of twice their standard errors.? It is not necessary to compute the 
standard errors of all п factor loadings to apply this test. If we examine (1), we see that the expression 
under the radical is the same for all loadings on a given factor. 
Let a/aq = k (according to Vernon's recommendation, k should be2); then a = koa, and from (0, 


а= (1—4) С, Bos $ 4 4 ow 
where С = kv(nNn—s D. . . . s . 

Now (2) is a quadratic in a, whose positive root is 
a = [-14- V/(1--4C2)]2C. e 


: r 
If we find C from (3) and then a’ from (4), a’ is the critical value of a: all loadings numerically gre 
than a’ are significant and all loadings numerically less than a' are insignificant at the ka le 


EXAMPLE: 


N = 600, n = 30, s = 11, k = 2. 
С = 2/(30/[600 (30—11--1)]] = +1. 
а" = [—1+ /(1-04)]/-2 = -099. 


The eleventh factor will be significant, according to Vernon’s standard, if and only if more th 
its loadings are numerically greater than -099, 


an half 


~ igni canc? 
„_ 1 C. Burt and C. Banks. Annals of Eugenics, XIII, 255; also this Journal, * Tests of signifi 
in Factor Analysis’, v, 121, eqn. xxvi. 
2 Burt’s criterion is more lenient : 
the standard error for the factor to be si 
into account. 


ice 

Р d twit 
with borderline cases at least one quarter must he loading 
ignificant. But in general he also takes the size О 
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A PSYCHOLOGICAL STUDY OF 
TYPOGRAPHY 


By CYRIL BURT, W. F. COOPER, and J. L. MARTIN 


University College, London 


Abstract. А wide variety of type-faces have now become available for books 
and journals. It therefore seemed desirable to investigate (4) the legibility and 
(B) the aesthetic merits of those in more frequent use. 


. . (4) Using tests of speed and comprehension, we have studied the 
influence of type-face, boldness, size, interlinear spacing, length of line, and 
width of margin on legibility both with children and with adults. The results 
have furnished provisional norms for children's reading books and for scientific 
journals such as the present. 

(B) Factorial methods, supplemented by an analysis of introspections, 
appear to yield a classification of both readers and type faces based on aesthetic 
preference ; and the data incidentally obtained throw considerable light on the 
reasons for such preferences. 


l. PREVIOUS INVESTIGATIONS OF TYPOGRAPHICAL PROBLEMS 


inte tating as a Medium of Communication. Psychologists have of late shown increasing 
pee est in what is called * the theory of telecommunication ， that is, communication between 
E who are not within visible or auditory range of each other! The statistical treat- 
evel of the issues involved, which is the most distinctive feature of the * theory’, has been 
Penso ss primarily in connection with telephonic engineering ; and, perhaps largely for this 
of tr. n, the attention of the psychologist has been concentrated mainly on auditory methods 
d cansmission. .But communication by visible symbols is not only the oldest but by far 
fse Опе. method of imparting information to a distant recipient ; and the problems 
are ting up print and reading print—the most thoroughly standardized of all such devices— 

Squatly amenable to the same mode of analysis. е 
пй f we take the * phoneme ' 2 as the unit of information, the English language presents us with a 
Brod К defined ensemble of alternative elements—12 vowels and 37 consonants—which the printer 
form eds to encode by using an alphabet of 26 visible signals, each of which may have four different 
which woman, italic, upper case (or capital), and lower case. , Analogous to the audible * noise’ 
interfere e feres with the accurate transmission of the audible information, there is a visuo-mental 
just rence or * blur’, which impedes the accurate transmission of visible information. Accordingly, 
as the intelligibility of a wireless message is measured by a logarithmic function of the signal-to- 


Phil Wa Burt, this Journal, IV, pp. 195-200 : also D. MacKay, * Quantal Aspects of Scientific Information’, 
and C ag., LXII (1950), С. A. Miller, Language and Communication (New York, McGraw-Hill, 1951), 
1953) - E. Shannon, * Communication Theory : Exposition of Fundamentals (London, Ministry of Supply, 
Phone Otto Jespersen, Lehrbuch der Phonetik (1916), Daniel Jones, Outline of English Phonetics (1928). Any 
alwa 1С symbol, e (for example), represents, not an invariable speech-sound made with the vocal organs 
to the in the same place, but a family of e-sounds varying slightly (even with the same speaker) according 
trouble sal context. Such a family is termed a" phoneme `- The neglect of this fact is a common source of 
ррепіп teaching children, particularly backward children, to read: see Burt, The Backward Child, 
American, IV and refs. It may be noted thac three more phonemes are needed to represent standard 
speech, 
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noise ratio, so the intelligibility of a printed message could be measured by a similar function of 
what we might call the symbol-to-blur ratio. 

As was perhaps only natural for the engineer, the formulae which he has developed to measure 
information treat the efficacy of the signalled message as a kind of * energy ' or capacity for doing 
work, and its negative is represented by an expression similar to that adopted by Boltzmann in formu- 
lating * entropy ° in terms of probability. However, for our present purpose it seems desirable to 
take account of yet another variable characteristic. Of two media enabling the same message 
to be communicated to the same recipient, with the same accuracy, that which delivers the information 
the more speedily must plainly be regarded as the more effective. The consideration of speed of 
reading therefore adds a further complication to the problem, and should perhaps lead us to look 
for some formula which will enable us to measure the * power ° of the media. > 

. We believe that the adoption of the concepts and methods employed in the theory of communica- 
tion would greatly increase the logical rigour of the psychologist's investigations. But we are some- 
what doubtful whether, in the existing stage of knowledge, the use of the corresponding algebraic 
formulae will be of much service. Psychological measurements are so crude and inexact that the 
elaborate mathematical refinements of these newer techniques seem out of place. In any case, the 
following article is intended merely as a preliminary exposition; and it has been our aim to make it 
comprehensible to teachers and others with no special knowledge of advanced statistical theory. 
For the most part, therefore, we shall for the present keep to the simplest possible methods Oase 
ment; and measurements of accuracy and speed will be reported separately in terms of units of шше 
and of ideational items. We shall begin by investigating the ways in which the visible appearan 
of printed matter aids or hinders the communication of the ideas which it seeks to express, in a WOTC: 
the conditions affecting legibility. 


II. LEGIBILITY 


The Legibility of Children's Reading Books. In this country the typographical questions 
which have chiefly attracted the attention of the psychologist relate to the sizes and shapa 
of the letters used for printing children’s reading books. Even on these points the number n a 
British investigations has been surprisingly few ; and those who are concerned with hos 
selection of suitable type for reading books or for verbal tests are still guided (so far as t rly 
pay any consideration to the matter at all) by recommendations put forward nearly fo 
years ago. и ир 
в In 1912 a committee was appointed by the Education Section of the British Association b 
investigate * the influence of school books upon eyesight’. In order to secure evidence on certain 
the questions then raised, Burt (at that time Psychologist in the Education Department of the Lon er 
County Council) and James Kerr (then Medical Officer in the same Department) carried out а num i 
of investigations among children in London schools. It was agreed that Dr. Kerr should deal Dt 
theclinicalaspects, while Burt undertook a series of experiments in the classroom and in the laborato 
es EO printer generously supplied us with a wide variety of test-sheets printed to OUT 9 

ions. 


The results of our investigations were summarised in the L.C.C. Annual Reports and Jet as ， 
Dr. Kerr’s larger treatise [7].2 The Final Report of the Committee of the British Association [ style 
published in 1917 ; and included a tabulated set of standards showing the size of type and t b the 
of printing suited to pupils of different ages. Most of the school books since published bY 


ing 
more competent firms have conformed fairly closely with the standards then laid down.? pur й 
wo! 


! One of our chief methodological difficulties has been to decide whether to take the letter or ШОУ of 
(and their respective significations) as our unit. In applying the theory of communication to the 5, sup. 
the child's speaking vocabulary, Burt preferred to take the word as unit (cf. this Journal, loc. Cainjy to 

ivl mr ‚ we nen the present series of experimen kept етс ity 
ide: 1 sychologists who have i i ibility have lim ibili 
to the legibility of isolated letters. Put with modern metoda or ТЕА to read, the legib jes 
of the word as a whole is even more i 


The two ri oth: 
the same result. i igati equires the use uite 
2 Cf. also Burt [3], pp. 271, 340-1. Dr. up result, A. complete investigation regu space are not dies 
so generous as those of the Brit. Assoc. сше; A review of practices generally сштеп befoel Б 
. G. F. i i i County wi 
the Annual Report of the Brit. Assoc. for 1911, p. CORPER E E tive топовтар? (4) 
See also R. D. Morss, * Т (L. A. Legros, Note on the Legibility of Printed Matter, 1922 ; C! у 
3A fairly co Eny he Neglected School Book ', Monotype Recorder, XXXIV, 1935, рр: 2j спар: lit 
(pp. 101-25 У Comprehensive summary of the chief American investigations will be found in [2 le origin? 
рр. ; See also bibliography, pp. 452-74). But the reader who has по easy access to t m state e 
to find that the age of the subjects of the experiments is sel odes 0 
to yield significant differences between two or more differen 
aet in what direction the differences pointed. 
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the next ten years a series of laboratory researches, often involving highly skilled techniques, were 
Carried out in America [9-14]. But since the Second World War little fresh work has apparently 
been attempted in either country. Owing to the great advances in printing during the last two or 
three decades, and the wide variety of types meanwhile introduced by the Monotype Corporation 
and other foundries, it seems highly desirable that the problems should be taken up afresh. 

„А few years ago, when the British Psychological Society proposed to establish a journal of statis- 
tical psychology, Burt, with the help of several research students, began a series of preliminary 
Investigations on legibility and typographical preferences among educated adults; and the more 
recent decision of the Society to reconsider its choice of printer has furnished an additional reason for 
ascertaining the views of potential readers and subscribers to this journal. The appeal inserted in 
the last issue has elicited numerous comments and replies ; and we should like to thank our corres- 
Pondents for their help. With the aid of teachers and others, we have continued and considerably 
extended the earlier experiments on school children. The account that follows will summarize, 
as succinctly as possible, the main conclusions that have so far emerged. As so often happens, 
One of the most valuable results has been to show where the methods adopted were at fault and to 
indicate more effective lines of inquiry. 


Methods. To determine the legibility of type, a large variety of methods have been 
employed by previous investigators. The commonest may be broadly classified as follows : 


1. Ease of reading letters, words, or sentences, judged by the distance at which they can be read. 
2. Accuracy of reading letters or words with brief tachistoscopic exposures. 
»3 3. Speed of reading passages of prose, when the reader's aim is to grasp the content of the 
Ssages. 
4. The observation of eye-movements, eye-blinking, and other objective symptoms. 


There can, we think, be little doubt that the results obtained often depend very largely 
On the experimental procedure used. For example, types that appear more legible than others 
When tested by the distance method may prove less legible when tested by speed of reading. 
. We ourselves found the first two methods quite effective for investigating the legibility 
of isolated letters or figures, particularly where the shape is apt to differ according to the style 
Of type. For our main purpose, however, which was to determine the legibility, not of 
isolated words or characters, but of consecutive sentences, such procedures proved inappro- 
Priate. With children the study of eye-movements, fixation-time, blinking, and the like was 
at times informative, especially in the case of young or backward readers, and of those who 
Suffered from minor visual defects. But on the whole, its value was merely supplementary ; 
It might often explain why certain styles of printing were unsuitable, but it did not afford 
a practicable basis for systematic comparison. The method therefore on which we have 
Principally relied consisted in getting each reader to read silently a page of simple consecutive 
Prose, timing his speed, and then determining by means of a questionnaire how much of the 
details had been grasped. Е Ta^ 
; For the preliminary experiments children were tested in class, and a fixed time-limit was 
Imposed. For tlie final experiments individual testing was adopted, and proved far superior 
to group testing. Every child was asked to read aloud an entire page, which had been 
evised to form a unit complete in itself; the time required was measured with a stop- 
Watch, and the questions answered огаПу.! At the same time a careful look-out was kept 
9r pauses, irregular eye-movements, and other signs of difficulty or fatigue. 
introduced an important modification. We became 


In the later stages of our work we i ion. l 
radually convinced that short tests, such as have been used by many previous investigators, 


Can often be quite uninformative and at times positively misleading. In changing to a fresh 
or unfamiliar mel the reader needs time to become adequately adapted. Moreover, 
erences in legibility and fatiguability may not appear during the first few minutes: many 
aders who can read 8-point as accurately and as quickly as 10-point, provided only a 
rief paragraph has to be read, often show signs of strain with the smaller types after they 
ave been reading one or two pages. Accordingly, instead of handing the reader the essential 
€st-passage straightaway, we first gave him a preliminary booklet printed in the same style 
9f lype and kept him reading it for 5 minutes, before subjecting him to the final test. 


Son 1 
Violet Benl experience in an Institu im 
ing. In th e recent experime the 
oper and by Miss J L. Martin, Who therefore appear as joint authors. 
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Influence of (a) Type Faces. The legibility of a page of print is the resultant of many 
different factors—the size, the form, the thickness or * boldness ° of the letters, the width of 
the line, the distance between successive lines, the texture of the paper, and the intrinsic 
interest of the subject-matter itself. We have carried out experiments on each of these 
variables in turn. " 

We began by trying to assess the relative legibility of the different book-faces that are now In 
common use. For this the experimental plan ultimately adopted was similar to that described in the 
investigation on reading carried out in conjunction with Mr. B. Lewis. The method then uscd 
was virtually a combination of randomization with a scheme of Latin squares ; but certain minor 
modifications were introduced to meet the special requirements of psychological research. For the 
present inquiry ten groups, each consisting of 15 boys and 15 girls aged approximately 10:0 to 11-0 
years, were selected by means of preliminary tests, the members being carefully matched so that the 
means and standard deviations of each group were approximately the same as regards both general 
intelligence and reading ability. Thirty short stories, each about a page in length, were constructed, 
and set up in 10 different type-faces—all roman—with 3 sizes for each (9-, 10-, and 11-point for the 
larger faces, 10-, 11-, and 12-point for the smaller). For each passage a questionnaire of 25 questions 
was drawn up ; and the drafts for both stories and questions were tried out first of all, and where 
necessary amended, until they appeared to be about equal in difficulty. In the final experiment every 
pupil was asked to read 10 passages, one in each style of type, the different stories and the different Sis 
being allotted according to the randomized scheme.2 The preparatory trials had shown that, айе 
the first test had been taken, the order in which such passages were read made little difference to th 
accuracy of the reading. Accordingly, the main experiment started with a preliminary stong 
used merely to provide a certain amount of practice. A short series of supplementary experimen 
weré made with italic type-faces ; but for these the groups were somewhat smaller. 


TABLE 1. SPEED AND COMPREHENSION WITH DIFFERENT TYPE FACES 


Roman Italic 
| ; 
Type Face Time Comprehension Time Comprehension 
(Seconds) (Items) (Seconds) (Items) 
E d 
Old Style (161) 96 28-0 = =i 
Imprint (101) | 97 29-1 113 269 
Times New Roman (327) 102 27-9 118 258 
Plantin (110) 105 26-8 129 250 
Bembo (270) 106 26-5 135 23.7 
Baskerville (169) | 108 27-0 137 24:8 
Caslon (128) 12 27-6 138 25-0 
Scotch (46) 114 25-9 108 27:3 
Modern Extended No. 1 (7) | —— 116 25.8 116 26: 
Bodoni (135) 125 25-1 124 258 
The numbers in brackets refer to the serial numbers of the M ype Ci ation. The chief difference 
between the type-faces used are illustrated by the REED Tette printed in the appendix (рр. 45-57). 
| їп 
Most investigators (cf. 22. pp. 114 and refs.) describe the si: for their per 
t investigators ‚рр. 1 .) di е size of the type used for {Пё ints 
ePi content Gin. But the x-height of different faces of the same nominal size in hat of 


bly : for example, with 10-point that of Times New Ri is 0:062 in. ; izes 

y 3 отап 15 :nt-SIZ€ 

Bembo only 0-050 in. To allow for these divergences the figures obtained with different point nt 
rama d ty ere and nen averaged, so as to reduce all to terms of the атр "n d 
,0- . (that of 11-point Impri i da 
and accuracy, are E in Table T ani Moceniga TRE нна " 


! Brit. J. 1 laine 
in the fooinotes on HDI aise ВО; 038. Тае 1959015 for the modifications are there XP 00 
* Most of our reading-passages have been set in SAS Dê, But in our earliest experiments a fes 1, 
In the earlier stages we also used Modern No: 1, these we 
T € faces, cut in 1900 and 1901 respectively. ir 
ond, Fournier, Ehrhardt, and Gill Sans: but for 
eight type-faces eventually selected. 


CYRIL Burt, W. F. Cooper, and J. L. MARTIN 


The data was subjected to an analysis of variance in the usual way. The variance of the means 
of the different groups, and the variances of the means for the different passages, were statistically 
non-significant ; the variance of the means for the different type-faces was fully significant. The 
differences between adjacent pairs are too small to be significant statistically ; but it will be observed 
that the order of the type faces in all four columns shows a fair amount of agreement. 

The British Association Committee recommended old faces rather than modern, 
and gave Caslon the preference. In our own experiments we found both Old Style and 
Imprint far more satisfactory than Caslon. (Old Style and Imprint may be regarded as 
modernized versions of Caslon, with its more peculiar features removed—notably the 
sprawling capitals which tend to distort the familiar word-form.) As regards roman types 
(i.e., the upright letters now ordinarily used for consecutive prose), Imprint appears on the 
whole to be the best ; Plantin! and Times New Roman are nearly as good, although with the 
Younger children their short ascenders and descenders seem at times to render the word- 
form less distinctive.2 И 

The modern type faces were the least legible. As the reader will see if he compares 
‘n’ and ‘u’ in (say) Garamond and Bodoni respectively, the older faces accentuate those 
Parts that are different, while the modern faces accentuate those that are similar. 
With the latter, therefore, letters are far more readily confused. Legibility is still further 
diminished by the excessive disparity between the thick vertical strokes and the thin 
Curves and slanting strokes: to a child who is slightly hypermetropic or astigmatic, 
a word like * minimum’ printed in Bodoni looks like a succession of i’s, and the whole 
line gives the impression of a palisade. Condensed faces (such as are seen in many French 
reading books) are also liable to look blurred to the hypermetropic eye ; on the other hand, 
excessively expanded faces (such as are occasionally found in American books) tend to disrupt 
the word form. 

Some of the commoner type-faces present special difficulties to children because of the unusual 
Shapes of certain letters. The lower serif or beak on the capital C in Bodoni, Baskerville, and italic 

aslon, causes many to mistake it for G. Capital Q in Imprint and Times Roman is sometimes 
mistaken for О. The old-fashioned Y in italic Caslon and Baskerville puzzled many young readers. 

he barred italic J of Caslon, Imprint, Plantin and Baskerville was read by many children as * f" ; 
and the italic л of Garamond and one or two other old faces tends to be read as ‘b°. In 
Caslon italic the extreme slope of the letters and the wide separation of many capital initials from the 
Test of the word also caused confusion. On the other hand, we found that, particularly with very 
Young children, the modern face R (with its curling leg) and Q (with its tail beginning inside the 
Counter) were more readily recognized than the corresponding old face capitals. The larger eye ofthe 
modern e makes for greater legibility : children with imperfect vision sometimes mistake a Baskerville 
Or Caslon e for c. The modern faces have one further advantage frequently noted by teachers : 
the ranging numerals are said to be much easier for children (those of Scotch Roman and Times 

Oman appear to be easiest of all) ; whereas the ascending and descending numerals of the older 
faces occasionally produce an appreciable hesitation, particularly when the figures are grouped. 
Here as elsewhere, however, much depends upon the style and shapes to which the child has been 
accustomed. For young children, we are firmly convinced, there would be a great advantage in 
Selecting a single legible set of letter-shapes, keeping strictly to these, and allowing no picturesque 
divergences, For them too, displayed material, such as titles of stories, headings to chapters, and 
the like, should be set, not in capitals, but in lower case with capital initials. 

We attempted similar experiments with adults (chiefly students and other educated readers) 
Confining ourselves for the most part to seven type-faces only (Imprint, Times, Baskerville, 


р 1 А smooth hard paper was used: Genesee a Suan to its heavier impression, Plantin would have 
Toved t i Fall: (see section on * Boldness °). à 

С c amis long descenders is now available. We may add that, although the 
long ascenders and descenders adopted by most * modern ' faces tend to increase the legibility of the isolated 
letters for younger children they spoil the characteristic unity of the words for older readers. 

3 Of the semi-modern type-faces, * Century ' (a type used for a popular series of schoo! readers) appears 
9n the whole to be the most readable. But its merits were not great enough for us to include it in our final 
Series (See Monotype Recorder, XXXIV, 1935, рр. 18 f., and David Thomas, School Books and their 

YPOgraphy,' Printi ev эней. 34, pp. 5-8). И * А 
fi BE NES d КУП Bembo, are now provided with an alternative set of ranging 

ures. We believe, however, that the question has generally been oversimplified and the effect of habit 
orally overlooked. ^ Certainly the descending 3 and 5 of Plantin and Imprint are easily confused. On the 
Sther hand, with lining figures 3 and 8 are confused, especially in reading small-type mathematical tables. 
Thus, We are inclined to think that, for tables like those of Chambers, a slightly ascending 6 and 8 would 
aid discrimination especially when ‘there is а chance of the plates getting worn. In our opinion, however, 
the whole subject Calls for further investigation by up-to-date psychological techniques. 


„э 
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o, Old Face, Scotch, and Modern). Unfortunately, it was impossible to plan the re- 
€: as systematically as before or to test such large numbers. Тһе results are theref or по: 
worth reporting in any detail. With adults, we found 10-point Times Roman, O dd 
more legible than 10-point Bembo or Caslon ; but this is mainly due to the fact tha iners 
former the x-height is much larger. When allowance is made for differences qt = ne 
there is, for persons of normal eyesight, virtually no difference in legibility betwe Ace 
commoner book faces. For older persons and for those with visual defects, Ope а. 
varying widely in the thickness of their lines and (to a less extent) those with Бан y Lien a 
tion at all seem the least legible. For such persons the general order of legibility арр а SP 
be much the same as for children. The chief exception is Modern No. 1 which сеа 
less difficulty to adults, particularly if (like Science students) they are already fairly accus 
to it. : ا‎ 

(b) Boldness. For readers who are hypermetropic or astigmatic, legibility 1S ION E 
by increasing the heaviness 1 of the type. But the optimal thickening is severely MS nize 
a point which has too often been overlooked : excessive thickening tends to a ee ly 
of the * counters ’ (i.e. the white inner spaces) in such letters as‘a’ and е’. А mode leer i. 
bold or heavy type seems definitely preferable in books intended either for the Vallic 
or the very elderly. For the normal eye of youth there is little gain but rather t ү Торт 

In several type faces the boldness is achieved mainly by thickening the vertical stro nonai? 
instance in Times Bold (328): Times New Roman Semi-Bold (421) avoids these gisprapo a (110) 
differences, but the set is a little too narrow especially in the larger sizes. With children ри ie 
owes its legibility partly to the fact that it leaves a somewhat darker impression, especially on a lower 
paper: but the peculiarities of the capital C and W, and the splayed capital M, as well аа е теа 
case ј, are unsuited to the younger readers. For them the most legible fount would E a ets to. 
semi-bold * Old Style,’ e.g., Monotype Old Style Antique (161)—a type design 2 which no se of the 
be seldom used for ordinary book work. For elderly readers we found Veronese (59)—0 
earliest of the machine type-faces to aim at aesthetic quality 3—apparently the least tiring. f size. 

(c) Size. Our next problem was to investigate the influence on legibility a deen 
We began with Old Style Antique. However, in producing books for youn а d that 
printers and publishers still use a wide variety of type faces ; and we therefore exi nings 
norms stated in terms of one of the more widely available founts, with no marked g morë 
toward any exceptional characteristics either in design or in heaviness, woul dingly, 

useful than norms stated in terms of a type that is now comparatively infrequent. Accor on 
after some consultation with teachers and publishers, we eventually determined to bas 
norms on Times New Roman. , :ye with 

The British Association Committee gives requirements in terms of minimum ае, ВС 
specimens in Caslon. We found, however, that, particularly with older children, а pr 00 
might err by being too large as well as too small.4 Hence we preferred to formu shown 
standards in terms of optimum sizes. The results 5 of our most recent experiments are aller 
in Table П. If a semi-bold type is used instead, the optimum size would be slightly ery 
for ages 8 to 10 and slightly larger for ages above 12. We would add that, for 


" in Type 
! For recent studies on this point, see M. Luckiesh and Р. К. Moss, * Boldness as а FiteadabilitY ， 
Design and Typography ’, J. Appl. Psychol., XXIV, 1940, рр. 170-83, and id., ‘ Criteria о ош degree? 
J. Exp. Psychol., XXVII, 1940, pp. 256-70. The writers experimented with Memphis type 0 
of heaviness: but the differences found were small and in our view of doubtful significance. uished from 
2 The term * antique ’ refers, not to the design, but to a particular kind of heavy face, disting 
that called * clarendon ' chiefly by the treatment of the serifs and by a wider set. _ ldine H 
? If the reader will glance at (say) the prefaces to pocket Shakespeares published from A ful face 
he will see how surprisingly legible even the tinier founts can be. But these somewhat colour! 
better suited to classical or poetic productions than to scientific work. mits of the young 
* Kerr points out that the largest of the large letters should in no way approach the limits О letter 


* apt yn n 
reader's area for distinct vision. In the British Association Report the largest of the арго ae 
intended for beginners (who were apparently supposed to read letter by letter) was 0°16 in. d ft 
observes that, at the ordinary readi 


1 e 
sng th ng distance, this would be about one quarter of the vertical periments 
area of distinct vision. However, his estimate for this area appears to be derived from early e reducti с 
on adult vision, and greatly exaggerates the vertical span for the small child. But in any Cases imates ¥ 
determinations of this kind appear to us to be decidedly precarious; and from more direct es ed 
believe that the maximum has commonly been overrated. 3 rts Grade 
> А rough notion of the effect of these requirements can be obtained by glancing at Bu to, 


5 Р d ce 
Reading Test (Test 1, Mental and Scholastic Tests, pp. 340-1) which is intended for children age type fa e 
and was based on an earlier set of norms differing but slightly kom those set out above. The same Y mitte 
has also been used in the Standardized 


(1954) Attainment Tests recently issued by the Kent Education 


ouse， 
s are 
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Young (aged 9 or below), the words should be well spaced out 一 with an em-space as a mini- 
mum instead of an en-space as a maximum : and thin spaces should not be used in books 
Intended for children. 


TABLE H. TYPOGRAPHICAL STANDARDS FOR CHILDREN'S READING BOOKS 


iere ure ЕТ E aT, Number of Length of |Interlinear Number of 
Age Type-face Type-body x-height ү cttersina Lines | Space Lines per 
(points) (points) | (inches) Lineof4in， (inches) | (inches) Vertical 5 in. 


24 30 0-150 30 5 0:250 12 
18 20 0-111 | ; 4 0-170 18 
16 18 0-090 3i 0-150 20 
14 15-16 0-075 3i 0-130 24 
12 14 0-068 Ы 4 0-110 28 
11 12 0-060 | 60 41 0-100 30 


In Column 4, * letters? means lower case characters or spaces, the space between two words being 
counted as one ‘letter’. 


For adults we used 8-, 9-, 10-, 11-, 12-, and 14-point Times Roman. With this face 
Set solid, the most legible size was 10-point with an x-height of 0:062 in.1 Judging by the 
Average figures, any diminution in size (even by one point) and any great increase in size 
(two points or more) tend to reduce legibility. Although large letters are the most legible 
When read singly, they by no means favour quick reading or quick comprehension when used 
Or consecutive prose: they cause visual attention to be directed to the letters themselves 
Tather than to the form of the words, and to single words rather than to groups or phrases : 
§ € bigger the type, the smaller the amount of reading-matter falling within the normal eye- 
Pan, and the larger the number of eye-movements and fixation-pauses. 


There were, howev arked differences in optimum size from one person to another, for the most 
ПШ related to differences in visual acuity. With individual readers possessing horal eyesight 
She effects of small changes in size were insignificant, Older persons. even with apparent yorma 
Маһ, Often found 11-point more legible than l0-point, The majority of rnm Suh CES 26) 
and 9-point equally legible. But with presbyopic readers 9-point was apt to p ч г pre mulos 
EU lo induce eye-strain when reading was prolonged : these effects were ev! m Ard y int A 

n ts of the questionnaire, but also in the irregularity of the eye-movements and in the increase 
king which became obvious to the investigator as he watched the reader. 


(d) Leading. The introduction of * leads ’ (i.e. thin metal strips inserted to widen spaces 
Весел the printéd lines) greatly enhances the legibility of small type. In particular, it aids 
Пе eye to pick up the right lines as it moves back from the end of one line to the beginning of 
a © Next—a point of special importance with children who are extremely prone to doubling 
i 9 skipping. In the table above, the * interlinear space г is measured from the base-line, 
16. the bottom of the non-descending letters, to the top of the non-ascending letters of the 
Aa next below, Roughly speaking, the distance shonn be aoii ИШЕ LOREM e fe 
eight 一 rather i hildren, rather less with older wi 
more with younger С , Sn 
Wide measures rather less with narrow, in general, say about one-thirtieth of the measure. 
tha En the type used is large on its body or has long ascenders and descenders, it is essential 
hat the leading should show a clear separation between the two lines especially when a 
Scender comes directly above an ascender. Me "d i 
, In this j ; " man are used for the text—10-, 2-, and, -point, all set 
solid, yg fournal mee sizes of Times P eoduction of one or two points of leading appreciably 
(Crease, thee Ne es ding (With type-faces that are smaller. ой Ше gody, ike Bo or Cason, 
Point арр; cia ; ms to be gained by 3-point leading. 4-point leading 
mually о аа рае letter-size, it tends to increase the number of eye- 
ma ents and fix de ЧУ. Plainly, the use of generous spacing can easily be overdone. as 
апу readers have probably felt in perusing the first editions of the works of Ruskin or Tennyson. 
1 This is roughly equivalent to 11-point Baskerville and 12-point Bembo. 
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(e) Measure. Table II indicates what we consider to be the most suitable length of line 
for children of varying ages. The limiting condition is a matter not so much of visible 
inches but rather of the number of letters. With adults our experiments were in the main 
restricted to reading matter set in 10-point Times Roman. With this type, measures! 
shorter than 20 ems or longer than 33 ems diminished speed and ease of reading. Very 
roughly, these are limits of 33 to 5} in., 55 to 80 characters and spaces, or 2 to 3 lower case 
alphabets.? For literary material the narrower measure is desirable (say 3} to 4 in.) ; for 
scientific the wider. For a scientific journal intended for highly educated readers, who of 
course tend to skim rather than read word by word, a measure of 5 in. seems preferable. 

With long lines of solid type the eye finds it difficult to pick up the right line in turning from the 
end of a given line to the beginning of the next; and large pages filled with solid-looking panels 
of printed matter are apt to repel all but the hardened scholar. On the other hand, short measures. 
particularly when the type is big, prevent the eye of the trained reader from taking in large phrases 
with a single fixation and from making the most of the subsidiary help given in the horizontal 
direction by peripheral vision. Moreover, they necessarily entail widely varying spaces between 
the words, and increase the number of broken words at the end of the lines—features which арргес!- 
ably hinder comfortable reading.4 

Where the reading matter requires to be read mentally word by word (as in poetry), small 
measures, wide interlinear spacing, and even small type with fairly broad spaces between the wor 5, 
аге an advantage.5 At the same time, small type further aids the eye to appreciate the shape of the 
stanza, especially where the metric structure is at all complex. 


(f) Margins. There can be little doubt that books with excessively narrow margins 
are more apt to produce visual fatigue, though the diminished weight of the volume may 
then perhaps slightly lessen manual fatigue. Moreover, when the type-area extends nearly 
to the edge of the paper, the eye of the younger reader is apt to swing right off the page. 
With adults the effect of the broader margin would seem to be chiefly aesthetic. 

. To gain light on this problem we carried out a number of experiments upon readers’ preferences 
in regard to the general imposition of the type upon the page. The results varied widely accor “е 
to the subject matter of the book (literary or scientific), the size and shape of the page, the size, Sty!" 
and interlinear spacing of the type, and so forth. To simplify the task we confined ourselves mainly 
to volumes whose pages showed proportions approximating to those of the golden section (1 : Mv5— 2 
i.e., 1-000 : 0-618) : this covers most of the ordinary octavo sizes. Expressing the margins as pe s 
centages of the measure, the weighted average of the preferred specimens gave the following PS 
portions (rounded for convenience) : (i) inner margin 17-5, (ii) head 22:75, (iii) outer margin ғ be 
(iv) foot 32:5. These are very nearly 4%, a's, 15, and 1% respectively. The total of all four, it will b 
observed, adds up to the size of the measure. as 

Let us consider how this would work out if applied to a demy octavo publication (such ® 
Spearman's Abilities, Thomson’s Factorial Ability, Burts Factors of the Mind, or the journal Min ЧУ. 
The dimensions of the trimmed page would be about 5} x 8} іп. ; and the foregoing recommendatior 


in terms of 
e correctly: 


designated a * pica’; e.g., [22], p. 114). habets， 

__? The last is a convenient mode of statement, since most specimen sheets give the length of the alp! 1 nt in 

With types of other sizes the same alphabetic rule will serve. If a large body is used (or the АШЫ? point 
t with 2 


ortane 
s imp from à 


ds оп 


А " " 5 . : а ize B 
3A line of 5 in. implies a page of about 71 in. wide, i.e., crown quarto. For books an octavo Bk 
more convenient. essen 
EE On the basis ofan admirably planned series of experiments Paterson and Tinker summarize ihe hort 
tial results as follows: * One may characterize the oculo-motor patterns in reading an excessively ation 
line by saying that the number of fixations is increased, the span of perception decreased, the mean ements 
of ixations increased, total perception time greatly increased, though the number of regressive mov? pack 
remains approximately the. same. .. . An excessively long line gives major difficulty in swinging | Eye 
to the beginning of successive lines’ (D. G. Paterson and M. A. Tinker, ` Influence of Line Width 88 пит. 
Movements aie кр Ао, XXVII 1942, pp. 574-6). Their optimum range for 10 point type (80 dins 

"ies rather narrower measures than tha: ed here. But otherwise 
seem perfectly consistent with theirs. плаш ith 8 
5 One of us has an early edition of Tennyson's Princess in i int) with © 
i 7 cess in a long primer (roughly 10-poin 
leading. Certainly the poet successfully compels his readers to read ЧОМУ ! 
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would imply a measure of 22 ems or 33 in. anda depth of 38 ems or 6} in. (i.e. a type-area somewhat 
paler than was actually adopted for the publications mentioned). With 10-point Times New 

горап set опа 12-point body, each page would then contain 38 lines, i.e., about 2100 characters or 
E tail” Tein ihe margins would be : (i) backs, $ in., (ii) head 7 in., (iii), fore-edge, 1 in., (iv) foot or 

Roughly speaking the two side margins should together take up a third of the entire width 
of the page, i.e., half the measure. When the interlinear spacing is increased, all the margins should 
be increased. For young children the proportions should be much wider, allowing plenty of room 
at the bottom for the thumb and at the sides for the untrained суе to swing to and fro. For infants 
the line should end with the end of a phrase, leaving the margins irregular. 

For highly technical works a larger page may often be desirable, e.g., royal octavo (91 x 6 in.), 
and somewhat narrower margins can be tolerated, and indeed, in the view of most readers, would 
even be preferable. For a technical journal a broader page is desirable, e.g., crown quarto 
(10 x 7 in, allowing a measure of 5 in.).2 

„The Interaction of these Factors. In the course of all these experiments it soon became 
Obvious that the procedure adopted by most psychological investigators, namely, studying 
the effects of the four or five main variables separately—size, leading, line-length, style, 
апа heaviness of face—is quite inconclusive, as indeed the practice of every good printer 
Should suffice to show. For example, a line of 5 inches is easier to read than one of 3 inches 
Provided 10-point or 11-point is used : when 8-point is used, the reverse is true. 9-point 
type, set solid in a measure of 5 inches, is harder to read than 10-point : but, set with 2-point 
eading, it is just as easy, at least for the average adult ; and it is almost as easy in a measure 
of 3 inches if only I-point leading is used.  3-point leading seems disadvantageous with 
Ordinary faces ; but with comparatively heavy type, and a face that is wide rather than con- 
densed,3 it is of definite assistance to children. 

Practical Corollaries. In summarizing the foregoing results, it may be of interest to 
Consider them more particularly in relation to journals such as those published by the 
British Psychological Society. It should again be emphasised that our conclusions are 
themselves merely tentative, and require further verification : other factors besides those we 
have studied here may conceivably have an overriding importance. 


" , Were we guided by the data so far adduced, we should be tempted to infer that the length of line 
(51 in.) which is adopted for the British Journal of Psychology (i.e., what until recently was called the 
eneral Section °) is, if anything, rather too wide for the size of type employed; that adopted for 
Ше Present journal (5 in.) is satisfactory for academic readers; that adopted for the British Journal of 
Educational Ps chology (4% in.) is well suited for a journal of a slightly more popular kind; while 
Ne short lines in.) recently introduced by the British Journal of Medical Psychology, which is set 
With two columns on a page, are likely to diminish speed of reading, though they may possess other 
advantages of their own. Much, however, must depend on the style of printing to which the reader 
15 accustomed. And in books and periodicals dealing with advanced mathematics the more claborate 
algebraic equations would of necessity be badly broken up if set in columns of 25 in.; indeed, it 
Was for this reason,that the editors of this journal at the very outset chose a rather wide measure. 


4, 1 It was notew at elderly readers (including the oldest of the present writers) strongly preferred 

4 much larger heed т E doubt as a result of preferences acquired when such margins were more 

fashionable. This would incidentally keep the proportions of the printed panel, as well as those of the page, 
ar to the golden section. А " T" 

In work k ion it is often stated that diagonals uniting the corners of the page should 
also pass ed baok product! the type area; that the inner margin should be half the outer: and that the 
Proportions should be approximately 11,2, 3, and 4. This mode of imposition satisfied o Sum readers ; 
bnd it was frequently ‘complained that the inner margins were so narrow that, wiih Ss and tightly 

Ound volume, the words nearest the inner edge were hard to read. It may be noted t а ШӨ proportions 
Suggested in the text imply that the printed area will be no more than about 50 per cent. o тео area оѓ 
the Page. The wa ле economy regulations required that the type area should fill at least 58 per cent. 

? The proportions referred for the margins of a crown quarto page (the size of the Society’s journals) 
Were much the same es those cited above, with slightly narrower head and bottom margins. „They fall 
Pidway between those adopted by the Cambridge Press for the Brit. J. Psychol. before and after 1942 (namely, 
1 in., 1°4 in., 1.6 in EH 15 in. with a measure of 4-4 in., giving a type area of 30 sq. in., and 0:6 in., 0-9 in., 
ii in, 12 in. MA of 5-3 in., giving a type area of 42 sq. in.). Note that in all these specifications 

© running headlines and folios are not counted as part of the type arca. Ethel —M— 

.? Times New Roman (327) has a rather narrow set: the w idth of the lower case roman alphabet, 
Set in 8-point (x-height 0-052 in.) is only 105 points, whereas that of Perpetua 11-point, which has practically 
the ame x-hei ЫТ 1 T Ur 11-point Imprint and Modern 7, (both with an x-height of 0-060 in.) 
have an alph 8 , vid 1 Pg 5nd 135 10-point Times Roman with a larger x-height (0-062 in.) has an 
alp] abet width o el 1 124 3 thie condensation makes it, in our view, unsuited to children under the age 
of 10, and caused a number of our readers to deprecate its use for literary publications. 
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For mathematical work the Committee of the Royal Society ! selected three type-faces -Imprint, 
Times New Roman, and Modern 7. For books we are inclined to favour 11-point Imprint (large 
face) on a 12-point body set іп a measure of about 23 ems (3:83 in.), or 25 to 28 ems if there are 
numerous equations. For periodicals this face seems a little too large, while the 10-point scems а 
little too small; and a larger measure would certainly be desirable. On the whole, the most favour- 
able type for such publications appears to be 10-point Times Roman with long descenders and with 
1-point leading : (2-point leading if the older version, with short descenders, is used). For subsidiary 
matter, 9-point with 1 point leading, which is almost equally legible, might be substituted. po 
serious difficulty seemed to be experienced with short footnotes of 8-point, even when set solid, 
particularly when they deal mainly with references or somewhat specialized comments. But a good 
many readers find it hard to cope with 8-point matter running to lines of 5 inches when continued for 
more than half a page.? On all these points we should welcome comments from our readers. 


Introspections. In the course of all these experiments, especially when our examinees 
were adults, we tried to elicit comments and introspections. It was most instructive to observe 
how often the subjective impressions of the readers failed to correspond with their actual 
efficiency. 


Certain readers, for example, reported that they found 12-point type more legible than 10-point, 
others that they found 9-point type quite as legible as 10-point type, when their actual performances 
demonstrated beyond question that they were wholly mistaken. Often they ascribed to the design 
of the type effects that were really due to the size of the type or to the leading. The spacing be n 
the lines and the length of the measure produces illusory estimates of the relative size of the print: 
with both 9- and 10-point type, readers who found that the change from solid type to 2-point leading 
improved their accuracy or speed frequently explained that they found “ the large type much easier 
when the size of the face remain unaltered. Thus, as Mrs. Beatrice Warde has rightly observet $ 
"What the book critic calls readability is not a synonym for what the optician calls legibility."* Near y 
all tended to read with greater facility the kind of types that they preferred, and were inclined to zan 
fuse intrinsic legibility with their private aesthetic preferences. As we have seen, preference depen fi 
largely upon custom; and throughout it seemed evident that almost everyone reads most Cast? 
matter set up in the style and size to which they have become habituated. 


III. AESTHETIC PREFERENCES AND THE 
CLASSIFICATION OF TYPE FACES 


The Study of Preferences. On looking at recent scientific publications both in k^ 
country and America, one is tempted to echo a remark of Bernard Shaw's: “ It is a mista 
to think that the modern author is insensitive to the beauty of a finely designed AES 
Well printed page: he positively hates it.” No doubt war-time regulations, the increasing 
need for economy, and, in technical subjects, the fact that the better designed founts E 
deficient in * mathematical sorts ’ may be in part responsible. But authors and readers e 
themselves largely to blame. The scientific writer is apt to declare that *' print is геп м 
for use not for ornament: its function is to be legible, not to look beautiful ". Scien 
readers, so far as our results can be trusted, vary much more widely than others in 0р of 
graphical taste and interests—the least sensitive usually predominating. And both au 


and reader are quite content to leave such matters to the printer or publisher. T 
ctor" 


The best publishers and typographers are fully alive to the importance of psychological fa Je to 
They recognize, as Oliver Simon puts it, that * there are aesthetic considerations prompting peor fine 
prefer one type rather than another, which are closely linked with the emotions and difficult to flate 
Precisely [21, p. 11]. In work on book production and in the journals of the trade there has arjiest 
been much discussion about the artistic qualities of different type-faces; and, from the © ning 
days of printing, sculptors, architects, painters, and engravers have applied their skill to the desig ical 
of type. Yet hardly any objective or systematic research has been undertaken on the psycho stiga 
questions involved. The psychologist, though he has studied legibility and made sporadic inve ested 
tions in almost every other branch of art both pure and applied, has remained singularly uninter 


seman” 
! See Notes on the Choice of Type Faces for Scienti iodi ; he Chairs. 
ific Periodicals (1950) drawn up under the © 77. (on: 
F р 5: Zuckerman by the Consultative Committee for Cee aon with Printing Organiza foot- 
RH ependently, Mrs. Beatrice Warde (of the Monotype Corporation) suggested that t redness 
notes mn pe Tornat zu be divided into two columns. The Editor would like to express his inde and fof 
Hee abis per indness in responding to various questions about the printing of the Journ 
3 Monotype Recorder, XXXII (1933), no. 1. 
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in the aesthetic aspects of printing—a form that silently confronts him every day of his life. The 
Problems, moreover, are not without their practicalimportance. This is perhaps most obvious in the 
Psychology of advertising. Here the readers’ reactions, whether conscious or unconscious, towards 
the style of printing adopted by the advertiser may defeat the entire purpose of the advertisement. 

€t once again, though numerous experiments have been carried out on visibility and display, hardly 
any attempt has been made to discover what particular qualities make different kinds of type-faces 
attractive or otherwise to different kinds of public.! 


Methods. In planning a tentative inquiry into the more obvious problems, we have 
followed much the same general procedure as was used by one of us in previous investigations 
on personal taste in literature, art, and music. We collected examples of all the commonest 
lype-faces now in regular use, and then submitted them to a number of persons of both 
Sexes, with the request that they would arrange the various specimens in order of preference. 

'othing was said about the differences being aesthetic. Indeed, it was part of our problem 
to discover how far, if at all, the judgments were comparable with other modes of aesthetic 
appreciation. As in our work with pictorial tests and with projective tests such as the 
Rorschach, ме used two methods of factor analysis—' correlating persons ’ and ‘correlating 

ems 
о (а) Correlations between Persons. We began with what is commonly called * P-technique’, 
1.0. correlating the rankings submitted by the several individuals and then factorizing the 
Correlation table by simple summation. The method was first tried with twenty persons 
Selected to represent different classes of reader. 


Their rankings revealed a moderately large * general factor for persons’, accounting for nearly 
41 per cent. of the total variance. Such a factor implies the presence of a common tendency under- 
ying the different orders; what is its precise nature we shall consider in a moment. The second 
actor was bipolar, and contributed 13 per cent. to the total variance. It subdivided the entire group 
Of persons into two subgroups : their factor measurements indicated that, with few exceptions, those 
In one subgroup tended to prefer * old ’ faces, while those in the other preferred ‘ modern'. However, 
а sample of only twenty persons is scarcely sufficient to establish such a classification with any finality. 
everal smaller factors were also discernible, each contributing about 8 to 12 per cent. to the total 
але : for the most part they appeared to relate to such influences as аве, eyesight, and habitual 
"ре of reading. А 2 
Introspections. All our subjects were asked to give reasons for their preferences. With 
typographical material the replies seem far less self-conscious—far less influenced by the 
natural desire to show up well, which so often impairs the sincerity of the responses to tests 
Of pictorial taste or personality. Yet the content of the remarks is so illuminating that the 
Whole procedure almost takes the character of a camouflaged * projective test’. Only one of 
Our critics had any technical knowledge of typographical differences ; but many of them, 
Particularly the book-lovers, seemed sensitive to what one of them called the * atmosphere ` 
Set up by the type adopted. ' 
, The reasons offered could be readily classified into much the same psychological cate- 
Bories as were reached in our previous tests with other kinds of aesthetic material. This 
Will be evident from the extracts given in the appendix, but may be briefly illustrated here. 


(a) Subjective. 1. Associative. (Of Modern 7 and Bodoni) : “І have always disliked that kind 

Of type because it reminds me of the print used in the French books we were compelled to read at 

ausanne—the most hateful time of my life." (Of Caslon) : “It seems to bring back memories 

. уе good olde days '—reading 18th century editions of Swift and Defoe in my father’s library." 
(Of Perpetua italic) : ‘* It reminds me of inscriptions on tombstones or memorial tablets in a church. 

Of Modern 7): “That for me is the only proper type for mathematical work, probably because 

Гуе always seen it in that kind of type." “ With types it's much the same as with the place you live 

In or the author you are reading: they're all so much more interesting when you know something 


about their history.” 


; chology, nearly twenty-five years ago, one of us took part 
їп the advertisements of a wel E Поп of drig aa Uecker. 
й A 5 xpectedly wide. Still mo: iking are the results we have 
Obtained ences in apparent А See group. Though the data so far available are 
Somewhat i Wer to avoid the conclusion that in the commercial world taste and fashion 
Change far mite » ТЕП i noa atically than in the world of literature or journalism, and that on the whole 
they Have 20/9 T Dh y CUL TS A the field of psychology they have definitely declined. 0 

° See С B t 5 |. pp. 280-8 : cf. also the classification of types based by Bullough and others on intro- 
SPeative eat [ рр. ing colour-prefereeces and the like: E. Bu A Brit. J. Psychol., M, 1907, 
P TIL CON рані оГ Psychology of Beant» (1919). 


in a p^t the National Institute of Indu 
The qag2tch on the letter-press to be use 
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2. Emotional. (Of Bodoni): “It seems to prick and dazzle. Horrible” (Of Caslon): 
* Restful and soothing." “Irritating! Those italics make me want to put them straight, like 
pictures hanging crooked on a wall." (Of Garamond italic with its different angles): * It makes 
me giddy to read it.” 

3. Anthropomorphic. “ A type should have an individual personality: that’s why these 
(Caslon and Garamond) go at the top.” (Of Bodoni): “ Cold, stiff, and rigid, like a row of black 
shirts on parade." “ Intellectual.” “Introverted.” (Of Caslon): “ Homely, unassuming. | 
(Of Fournier): “ Whimsical.” (Of Baskerville): “ Genteel.” (Of Perpetua): " Aristocratic. 
(A general remark): “ A type should have no personality of its own: you expect handwriting to 
reveal something of the writer’s character, but a printing machine has no character to reveal. Now 
these types seem to be saying * Look at me ; aren't I ingenious [Baskerville caps]? Aren't I dignified 
[Bodoni]? Aren't I quaintly old-fashioned [Caslon]? Each is trying to express itself when it ought 
to be expressing the author’s meaning.” 


(b) Objective. 1. Intuitive. (Of Bodoni): “It gives you a general impression of an orderly 
formal scheme—straight verticals on a straight horizontal, everything correctly fashioned and every" 
thing correctly finished." (Of Caslon): “І placed it first, not for any particular features that I 
can single out, but for its general look—its Gestalt-quality, if you like : it forms just the right kind 
of pattern made of just the right kind of shapes." (Of Perpetua): “ Every letter strikes you at once 
as well-proportioned, like the columns, doors, and windows in an Adam fagade." 


. . 2(a). Rationalizations. “I'm an arrant functionalist : to me a letter looks well when it's Чота 
its job well—no frills or fancy touches." (Of Scotch Roman) : “ Letters to be printed with moveat e 
types, instead of written by hand, should be all of the same size! ; these come nearest of those you 
have shown me. On the other hand, in this passage (Times Roman) the S and the J are far too 
narrow ; they should be the same breadth as the U; and the W is too broad—it would fit betters 
the strokes were crossed like that " (the Bodoni W). (A general remark): “In my view. the 
principle of parsimony requires that letters should be constructed of a minimum equipment of lines ч 
three kinds of straight strokes—vertical, horizontal, oblique, and two kinds of circle—full size zd 
and Q, and half size in S and B, with both halves exactly equal. Evidently, then, letters like S ane | 
should be exactly half the width of O. N should be a perfect square ; and letters life E and F exactly 
half the width of N. E should be two small squares—cross strokes allequal.? This (Centaur) com 
closest to my own ideal. The more they violate these principles, the lower the types go in my Бе 
The proper way to judge them would be to measure their dimensions under the microscope. 
speeding: 

Most of our readers sought to judge all type faces by the same fundamental criterion, differing 
from one reader to another. There were, however, a few exceptions. Several of the more т 
asked if they could alter their rank according to the subject matter for which the type was to be yon 
Four insisted that it was impossible to suppose that one and the same rank-order would hold for evie 
kind of publication. ‘Old’ faces and ornamental details (like the swash capitals in Baskerv! 2 
italic, or the unusual К, z, and О in Garamond) appealed to two readers as suitable for poetry N 
historical romances, but were pronounced quite inappropriate for a daily paper or a scientific journa Р 
Three remarked that they liked the reprint of a classic to be set in a style suggestive of its репор. 
One put the principle still more explicitly : * I should like a plain machine-made face for scienti 
books and articles, a slightly more artistic style for what they call belles-lettres, a decorative typ? 
imaginative prose, and a slightly archaic form for the older writers or stories of bygone days; i ose 
I should hate to see Pater or Malory set up in this type (Bodoni), or Kelly’s Statistics in either of th 
(Perpetua and Garamond). There's nothing intrinsically bad about any of them. Aptness 
only criterion.” 

Nevertheless, an explicit recognition of these different functions was comparatively r 
among our more literary readers. It was, indeed, surprising to note how few appeared in any €% 
typographically conscious. Many started straightaway with some deprecatory remark, to ihe? 
that " all kinds of type look much the same to me”. Yet, after a few moments’ scrutiny, abou! © 
in three would become either wholesale critics or ardent admirers of this or that set of feature? 
some particular group of faces. 


(b) Correlations between Type Faces. To investigate the nature of these differ e 
preferences we compiled a special set of test-material consisting of extracts cut from pri” 


are, even 
degree 
(есі 
one 
in 


ential 


А А ‘ciple 

_ 1 The idea that every printed letter should fit into a rectangle ame size and shape was а prine be 
EIS AUR explicitly adopted by many of our readers. Here i uen ded of the typewriter may : 

^ E 3 iow 

_? It was surprising to discover that even our psychological critics were quite unfamiliar with the cu ily 
optical illusion whereby the top halves of the S, 8, ètc., appear much larger than they are (as may be nicallY 
seen on turning the page upside down). If, therefore (as this reader demands), the letter S was mecha ously 
constructed so that the top curve had exactly the same size as the lower, the top curve would, CU 
enough, /ook very much larger. ia 
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specimen books, keeping so far as possible to the same or similar prose passages for all 
type-faces, but including in every instance a complete alphabet, in upper and lower case, 
both roman and italic. We selected type with an x-height of 0-062 in. or thereabouts 
(10-point for the larger faces, e.g., Times New Roman, 11-point for the smaller, e.g. Imprint 
and Modern 7)! These were submitted to 93 judges—teachers, students, University 
lecturers, and educated readers of the commercial or industrial classes. Most of them also 
took part in short tests of pictorial and of literary appreciation : (those described by Dewar 
and Williams [17, 18] were used). 

A group of nearly a hundred is too large for P-technique. We therefore decided to ‘ cor- 
relate tests ’ rather than persons. For this purpose we first converted the rank orders into 
normal deviates ; these were then reduced to deviations about the average for each type-face. 
Finally, we calculated all the inter-correlations between the different type faces, and factorized 
the table so obtained. It shows two peculiarities which have occurred 一 though not with any 
frequency —in other investigations on personal preferences (e.g., in a study of pictorial 
Preferences by Sybil Crane): (i) the largest factor is not a general factor with positive 
saturations throughout, but bipolar?; (ii) after the general and first bipolar have been removed, 
the presence of residual submatrices with figures approximating to zero indicates the presence 
Dot of further bipolars, but of group factors. The final set of factor saturations was obtained 

, In the usual way by arithmetical rotation. The results are set out in Table III. 

In many respects, as will be seen, though not in all, the classification indicated by the 
pattern of siens resembles the broad, semi-historical classification adopted by most books 
Оп typography. To secure further light on the reasons for the divergences and similarities, 
We endeavoured to secure introspections, particularly from typical representatives of the more 
dogmatic critics. Relying mainly on these two lines of approach—quantitative and quali- 
tative 一 we may provisionally put forward the following interpretation of the more con- 
Spicuous factors. 

I. There is first a small general factor which here accounts for only 3 per cent. of the variance, 
More than half its saturations are devoid of statistical significance. Evidently the factor simply 
Indicates that the more general differences between the type-faces were not completely abolished by 
aking an ordinary unweighted average when the rank-orders reduced to deviations. 

II. The largest factor of all is a bipolar factor marking the contrast between (A) Old Face 
types and (B) so-called Modern Faces. Two of the specimens in our series—Baskerville and Times 
New Roman—had saturations which, though positive, were statistically non-significant. 

IIT, After the general and first bipolar factors had been removed, the table of residuals, when 
Appropriately arranged, showed four square submatrices containing fairly large figures, while the 
rest of the figures were nearly all non-significant. To fit this pattern four broad group factors were 
Obviously required. Of these (AI) the first classifies together a set of ‘ old ’ type faces modelled 
Chiefly on Italian or French designs: (two smaller group factors distinguish the two). I shall call 
this group Continental old faces. (A2) The second includes all the British old faces. The other two 
factors subdivide the * modern ' faces. And once again the main division is into (ВІ) Continental 
and (B2) British. Those who are interested in the details of this classification will find them discussed 
In the Appendix. 

Implications of the Factorial Results. We do not suggest that these statistical results 
Make any new contribution to what we may call historical or comparative typography. 

heir implications are of psychological rather than typographical interest. Supplemented 
the introspective comments and biological details obtained from our various readers, the 
ata as a whole throw considerable light on the psychological nature and the apparent causes 
the varying responses which different type faces excited. There are two main questions. 


! This decisi imi Я а ared in our first exploratory studies. Certain sizes аге 

ision eliminates one factor that арреаге k 1 ; 7 

undoubtedly more favourable to certain type-faces. Thus, Imprint, Plantin, and Times New Roman 
Seem most effective in medium sizes: Caslon, Garamond, and Centaur show up best in larger sizes. 

Ince this is itself merely a preliminary inquiry, several important conditions had to be ignored. 


Perha i k a ired fc 
aps the most i i at of paper and ink. Half-tone blocks, such as are required for many 
faentific works need ISSY pos and eu these types like Caslon or Garamond would be quite unsatis- 
ОГУ, whereas Bodoni well. ee 
see Б Бога discussion OSEE рн matrices in which the largest factor is a bipolar, not a general, factor, 
; later and C. Burt, this Journal, IV, 1951, pp. 9-20. 


In theory thi i indicate which particular type face is most representative of the family 
to Which it belongs. "Bat the EM too Sender for much importance to be attached to these minor 
Tences, " 
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TABLE Ш. SATURATION COEFFICIENTS FOR TYPE FACES 


FACTORS 
TYPES Old- 2 
General Mod. Cont. Brit. Ital. Fg Cont. Brit. 
A. Old Face 
1. Continental 
a. Italian б. 
Bembo +173 4-746 4-342. 一 :062 +-433 一 :063 一 :084 —-034 
Veronese +106 4-318 +-182 4-165 十 :226 4-037 —-033 3 
Centaur +127 十 '443 十 '217 十 :144 4-271 一 :042 十 :096 -+116 
b. French 5 
Garamond 1230 +732 +304 一 -058 —024 +446 —028 —045 
Granjon | +204 +-728 +295 +-095 一 :052 十 '424 4-106 c 
Fournier 十 '191 4-581 4-342 一 :107 067 +-519 —-090 —:167 
Plantin 3-212. ”十 '465 2-239 十 :151 十 :105 +350 +-152 Ed 
Ehrhardt +095 4-403 +-378 一 :083 —-043 +-285 —:047 ~1 
2. British 
Caslon +148 +714 一 '113 4-536 --:040 一 :031 — 079 TE 
Baskerville +173 +170 十 '145 2-443 一 :067 4-073 +-181 ub 
Imprint +184 4-429 —-132 4-515 一 :031 一 :045 —075 —' 81 
Old Style 十 '165 ”十 :357 4-107 4-369 +-071 062-036 mi 
Times New Roman +081 十 '187 一 :083 4-270 4-056 —-058 一 :041 +12 
B. Modern Face 
Ts. Continental .059 
Bodoni 1219 —780 4-172 121 4-033 47054 4-834 404 
Didot +175 —660 一 '123 —-032 — 051 十 .167 3 ET 
Walbaum +161 一 "636 十 '160 一 :045 4-046 一 :053 4-367 — 04 
2. British .480 
Bell +102 — 324 —-107 4-118 十 '054 一 '092 —-058 435 
Scotch Roman +346 —542 +191 一 '064 十 -012 十 '129 —-102 25 
Modern 7 上 ,364 715 一 '096 一 '086 一 :038 十 :151 4-085 +3 
Contrib. to Variance (per 5 
cent.) 27 304 68 42 19 54 34 3 


Figures in Bold Type are statistically significant. 


First, what determines the general order of the type faces—their aesthetic quality, B 
legibility, or perhaps some other typographical characteristic? Secondly, what determin 
the individual differences—aesthetic appreciation, visual acuity, or perhaps some kin 
temperamental characteristic? eral 
General Order. The first factor obtained with P-technique may be regarded as à geo 
factor for ‘ typographical taste’. The factor saturations for persons can therefore be 


D 


" n mo е i 0; 
This provides a provisional answer to one of our questions. So far as these results fa 
we may fairly conclude that the differences between type faces on which these prefer 


Orders for Main Group and Subgroups. With P-technique the factor measurements or, 
test-items give a weighted order of aesthetic merit for the type faces used. It will, how®, ce 


sta lic 
In Table IV we give the averages of the individual rankings for both roman and ta 
founts. In addition it seemed advisable to separate our readers into two main £T the 
according to their interests or previous education : for convenience we may call ma for 
‘literary’ and the ‘scientific’ groups respectively. The rankings have been avera£€* the 
each group separately ; and it will be seen that they reveal suggestive differences. г roma 


table the various type faces have been arranged according to the average preference fo 
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founts, since this would seem to provide the best all-purposes order. The weighted order 
given by the general factor for persons is almost identical with the average ranking given 
by the ‘ literary ° group. 


TABLE IV. PREFERENCES FOR DIFFERENT TYPE FACES 


mi Roman Italic Gömbe 
ype Literary Scientific Total | Literary Scientific Total Total 
group group a group group 

Imprint (101) 1 6 1 3 5 3 1 
imes New Roman (327) 7 2 2 8 "i 6 2 
Bembo (270) 3 9 3 2 10 4 4 
Plantin (110) 2 13 4 7 6 3 5 
Baskerville (169) 1 5 5 11 4 9 7 
odern (7) 15 1 6 4 1 1 3 
aslon (128) 6 12 7 6 8 8 8 
Bell (341) 13 4 8 10 3 7 9 
Scotch Roman (46) 16 3 9 | 5 2 2 | 6 
ranjon (Linotype) 8 14 0 | 13 16 Ја | ï 
Centaur (252) 4 18 11 1 13 10 10 
eronese (59) 5 19 12 12 12 13 13 
‘Garamond (156) 9 17 13 9 li 11 12 
Ournier (185) 10 16 14 15 15 15 15 
ld Style (2) 14 11 15 16 9 12 14 
Ehrhardt (453) 12 15 16 14 19 16 | 16 
Didot (520) 17 8 7 |17 18 19 17 
Walbaum (374) 18 7 18 ] 18 17 18 18 
| 19 14 17 19 


odoni (135) 19 10 19 


The Bipolar Factor. The second factor obtained with P-technique, it will be remembered, 
Was a bipolar factor, distinguishing those who prefer old faces from those who prefer modern ; 
and, in accordance with the so-called * reciprocity principle’, the first bipolar factor obtained 
With R-technique is, here as elsewhere, virtually the same as the first bipolar factor already 
Obtained with P-technique. The former—those who (according to the factor measurements) 
Preferred old faces—turn out to be, with few exceptions, students or lecturers in the Faculty 
of arts ; the latter—those who preferred modern faces—include most of the regular readers 
of Scientific or mathematical works, a small proportion of the * literary group’, and the large 
majority of the nen-academic (i.e., commercial or industrial) readers. In these cases, there- 
Г, the individual deviations from the general order of preference appear to be in part 
àt least an effect of habituation ; and the presence of this large bipolar factor implies, or at 
o o is consistent with, the view that different kinds of type are suited to different kinds 

Job. 


There iently striking exceptions to indicate that familiarity 
and habit ms T iw uet s A Б EAE the results of our supplementary tests 
ie instructive, They indicate that a preference for the older type faces is associated with what 1 
faye loosely called * romantic * tendencies in other forms of artistic appreciation, while р 
bet todern type faces is associated with a * classical’ taste! : there is in fact a correlation of +43 

Ween the iwo sets of factor measurements. 


Legibility and Preferences. For any given type face and any given person it is Possible, 
fi € methods described in the foregoing sections, to derive two distinct measures—(i) 
207 legibility or rather ease of reading (obtained by combining the assessments for speed and 

“suracy) and (ii) for aesthetic preference (obtained by converting ranks into standard 


«CE î 2 In these earlier researches evidence was obtained which indicated 
tt romande? and! USE па were in some degree dependent on temperamental differences. 
not gr this were also true of typographical preferences (and on this we had no direct information), it would 
diffe; glow that the typographical preferences were directly affected by temperamental differences: the 

ences, for example, might conceivably have influenced the choice of academic subjects. 
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measure). Thus for each separate fount we can compute a coefficient of correlation showing 
how far, if at all, these reactions tend to go together. For each of the roman types in our list 
the correlations were positive and (with one exception) significant: they ranged from 
-23 for Times New Roman to -47 for Modern 7. 


To secure further evidence on the significance of this relation, Mr. Cooper constructed a Sirenen 
test-chart with Modern 7 letters, and tested the visual acuity of each available reader at a distante S 
6 metres. The correlations for visual acuity were -68 with ease of reading and :36 with preferens A 
Eliminating the effects of differing visual acuity by the usual statistical procedure, we obtainec y 
partial correlation between ease of reading and preference amounting to :33—fully significant wit 
the number tested (53). Taken in conjunction with introspective evidence and other observation 1 
data, the result strongly supports the view that in practice legibility is not merely a matter of the size 
and shape of the black marks on the white paper or of the physiological efficiency of the eye : there 
is also what may be called (in the fashionable phrase) a * psychosomatic ’ influence at work : be: 
matter seems more legible, and reading becomes more accurate and quick, when the material is S 
in a type which the reader, perhaps without realizing it, finds aesthetically pleasing. The reader, 
of course, rationalizing his impressions, puts it the other way round. 


Supplementary Factors. As we have seen, the remaining factors group the type-f: aces into families 
and indicate how far their special peculiarities harmonize with one another, and influence, 1n Yo ed 
degrees, the preferences of individual readers. It would seem that, just as the style of type cv ЗА о 
by different nations ог by different epochs reflects their cultural characteristics, so the preferenc ore 
our readers are themselves related, often in a somewhat complex and indirect fashion, to their a 
specific interests and cultural attitudes. Such influences are particularly obvious in the case Ge 
outstanding individuals who in the past have sought to make a contribution to the theory or pra 
of typography. How far it is true of ordinary individuals further research alone can prove. 


8 to 
Implications for Psychology and Education. It would scarcely be appropriate here 


enlarge on the corollaries that might be drawn for the psychologist or for the educationis 
and teacher. Two points, however, deserve brief mention. On comparing the те ae 
obtained in our more recent experiments with those obtained by one of us some thirty кеп 
ago, we have been struck by the marked changes in typographic preferences that have ne 
place during the intervening period ; and this has brought home to us the need, and the UN he 
bility, of extending such investigations in the temporal or time-dimension by using what a 
factorist would probably call * O-technique’. Indeed, a psychological study of the gra 


development of taste and style during the four hundred years that have elapsed from ;ve 


: i 
time of Gutenberg and Jenson to the present day might well provide a simple and instrue" of ， 


contribution to what has been called, perhaps a little ambitiously, the * psycholo£ 


history’. inent 

At the same time we venture to suggest that those who hold that “ a more рг ing 
placein the curricula for boys and girls should be assigned to aesthetic subjects and deve jude 
a capacity for the appreciation of art and industrial design " ! might legitimately inc the 
in their efforts the appreciation of good craftsmanship in typography, in printing, an iful 2 
production of books. “ A good piece of lettering” (it has been said) “ is as beau sO 
thing to see as any sculpture or painted picture.” 2 And the work we have observed jon? 
of the Т.С.С. Schools of Arts and Crafts shows what can be achieved in this dire? 


IV. SUMMARY AND CONCLUSIONS 


1. By means of tests of accurac: ions Of ster- 

ans. y and speed, supplemented by observati eter 

movements, blinking, and other symptoms of eye-strain, an attempt has been made to doer 

mine the relative legibility of different styles of printing. The chief novelties in the jicatio” 
севе ets inclusion of newer type-faces issued during recent years and the apps. 

€ crucial tests after a period of preliminary reading to permit fatigue or adaptat! pem 


2. It was found that the different ch. isti i i e, Wi ther 
: 1 aracteristics—size, design, boldness of tyP® athe’ 
measure, of margins, and of interlinear spacing—all influence and interact with €a 


111. 
1 Report of the Consultative Commi , ‚ Education, P* 
2 ие S р on e era the Board of Education on Secondary Edit сш? 
„ 3 It máy be added that the first edition of Burt's Cause 18 Jcwardness 
printed, and bound by boys in one of the schools of the National Children's Home nS 
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SO that assessments obtained by varying just one characteristic in isolation may at times 
be highly misleading. Revised norms for reading books have been computed for children 
of different ages, together with an indication of the most suitable measures and leading. 
On the whole, Old Style Antique appeared most appropriate for children under 12, and 
Imprint, Plantin, or Times New Roman for those over 12. With adult readers enjoying 
normal vision wide variations in design, size, or measure seemed permissible without greatly 
affecting efficiency of reading. Type having an x-height of about 0-060 in. (e.g. 10-point 
Times New Roman or 11-point Imprint or Modern 7), with 2-point leading (or 1-point 
With type small on its body) and a measure of 20 to 33 ems, proved to be most satisfactory 
for general purposes. Slightly wider measures and somewhat narrower margins seemed 
preferable for technical as contrasted with literary or popular publications. 


f 3. The application of the methods already used by psychologists in research on other 
ris of artistic appreciation—ranking of items and a factorial study of the results— 
[ийе an independent classification of type-faces similar to what may be called the historical 
e assification, but differing suggestively in minor details. This psychological classification of 
ype faces implies a corresponding classification of persons according to their distinctive 
preferences ; and the factor measurements showed significant correlations with the type of 
aesthetic appreciation found with other test-material. Partial correlations indicate that 
Apparent legibility may itself be strongly influenced by half-unconscious preferences. 


. ,*. The introspective data obtained during our experiments on typographical preferences 
disclose a highly complex motivation—the customary reading and the cultural interests 
of the reader playing an unexpectedly important role. The reasons offered by our subjects 
In explaining their likes or dislikes reveal much the same motivational types as have been 
noted by Bullough and others in their work on the appreciation of colours, pictures, and 
musical chords ; and the content of the responses often showed a close resemblance to those 
Obtained from projective tests with inkblots or geometrical designs. 


5. The conclusions reached in this preliminary report must be regarded as provisional 
Only. The study of preferences for different styles of printing appears to be an unduly 
neglected branch of the psychology of industrial design, possessing considerable practical 
Importance for the printer and advertiser and much theoretical interest for the psychologist. 

Umerous incidental problems were encountered in the course of our experiments that call 


‘Urgently for more intensive research. 


APPENDIX 


THE CHARACTERISTICS OF THE COMMONER TYPE FACES: READERS’ 
COMMENTS AND HISTORICAL NOTES 


р Modern Developments in Printing. As reported in our previous issue, the Council of the British 

Sychological Society has decided to refer the choice of type-faces for its various publications to the 
New Publications Committee. If, however, such discussions are to be fruitful, it is plainly essential 
that those concerned should know what type faces are now available and what are their specific 
characteristics,1 We know of no short publication giving this information in a compact and non- 


1 There is, a prior issue. Many members of the Society hold that, in order to preserve 
Fontinuity， sda ot conge, E in the type or format of any journal. Others believe that there should 
‘ It will be observed that on both these issues 


Опе type д a ications of the same society. 0 
ve do ОО ae We may note that, in response to our appeal for the views of readers 


Ourselves tak, ite view. eT n ч Р 
dd members. E m that (if we may quote one correspondent) ' in spite of the striking 
“ances made in British printing during the last 20 or 30 years, most of the Society's journals are still 
Printed in the same type and style that they adopted before the first world war’. The medical journal, it 
ay be noted changed the type-face and the printed area in the middle of Vol XIX (1943), though the 
its 97 54105 that he desired to make the changes 10 years earlier. Psychometrika has quite recently changed 
We of printing i i í 
e d " ; the generosity of the Monotype Corporation, of Linotype, 
Spec! of HEE are particularly indebted 10 от Blake (Caslon Letter Foundry) for presenting us with 
dj çimen books specimen sheets, etc., usually available only to the trade. Examples showing the more 
usednctive letter-forms in the various type faces that we tieg WU bo aad 12] Br Басы LM 
Sed in hun А ose r. Josep 2]. ave also hank 
wil: avid going them is EON Бон to the Publications Department of University College) 
© Was good SEGUE Bue Pese pages in proof, and suggest several important corrections. 
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technical form. The same want has been felt in discussing the selection of type for children’s books 
and for verbal tests ; and several teachers, psychologists, and education officials have asked for some 
such synopsis. A detailed enumeration of distinctive peculiarities will have little interest for the 
ordinary reader ; but it is indispensable for our purpose, since (as we have frequently observed) an 
unsuitable eccentricity in one or two letters is apt to be overlooked until the type face has been chosen 
and the material actually set up. In citing examples, we have, so far as possible, tried to name 
books and periodicals which would be familiar to psychological and educational readers, and at the 
same time illustrate the use of the italic as well as the roman forms. Those who have taken part In 
our experiments and discussions have frequently inquired about the origins of the various designs, 
and of the names by which they are known : and consequently we have included a few brief notes on 
the sources of the commoner faces. Indeed, without some knowledge of their history, and of the 
conditions under which they were produced, it is difficult to appreciate the nature and appropriate 
functions of the different types. 

Origins. Printing is an art in which, for both historical and technical reasons, the personal 
element and the mechanical are blended as they are in no other. This double character, as may be 
seen from the introspections of our subjects, is largely responsible for the conflicting criteria to which 
they variously appeal, and for the unique and complex problems in aesthetic psychology which the 
study of printing presents. The production of books was the earliest industry to which methods of 
mechanical standardization and mass production were applied.! The idea of using movable types 
was apparently due to Dutch woodcutters. The special invention of Johann Gensfleisch (° Gooseflesh ， 
known as Gutenberg, from the village from which his family came) was the punching of metal matrices 
and the use of adjustable moulds in which movable types could be cast in large numbers. For the 
results to be intelligible to his readers the printer had necessarily to imitate the handwritten letters 
with which they were already familiar. Gutenberg (1454), like Caxton after him (1474), printed in 
* Gothic’. What today we call ‘roman type’ was evolved in Italy, between 1465 and 1470, from а 
curious combination of Roman inscriptional majuscules, dating from the period of Trajan's column 
(2nd century, д.р.), and а Renaissance revival of the Carlovingian minuscules.? As a result the 
shapes of our modern capitals are derived, not from the attractive curved uncials, developed by 
scribes during the fourth and fifth centuries, but from square geometrical forms incised on stone with 
а chisel; on the other hand, the smaller characters of our * lower case ’ still preserve much of the 
qualities of handwritten letters, notably the oblique thickening produced by the slit and slightly cante! 
nib of a reed or quill.3 


.. , А. Old Faces. The types commonly grouped under the phrase * Old Face’ are characterized 
(i) by light * colour’ (in the printer's sense), (ii) by comparatively slight differences between thic! 
and thin strokes, (iii) by bracketed and sloping serifs, (iv) by the oblique or * biased * shading which 15 
characteristic of pen work, and (v) by figures which do not range on the line.4 11 

„Аз noted above, those of our readers who obtained positive factor measurements for Bipolar | t 

(mainly readers in our ‘literary group’) preferred types belonging to this family. It was here tha 
the conflict between the two rival criteria became most evident. This is clearly discernible in some О 
the introspective comments quoted above. The following are still more pertinent. “ This style 
printing (Modern 7) seems lifeless and machine-made ; the others (Bembo and Caslon) look as though 
they had been drawn by a human hand.” ** Those types (Bodoni and Scotch Roman) look too geo 
metrical ; a book is a work of literature—of art ; and the type should have something of the organ 
grace of а freehand line.” With these contrast the criticisms offered by the other group. “ The 


1 Many of our older readers, especially those who had been influenced by the teaching of Morris ang ag 
p 


followers, were inclined to protest against what they supposed to be ‘ the gross deterioration in ‘ne: 
produced by the application of mechanical methods But the printing DIS itself essentially a machin; 
and such protests are five centuries too late. Shaw's secretary relates an instructive Story. s c 
wanted the type (for his collected edition) to beset up by hand : in the Morris tradition, he scorned 
2nd the usurping monotype.” The managing director of К. & R. Clark arrived with two specimen P: (as 
an d him to pick that which he thought the best. Shaw unhesitatingly chose the page which 
it fune .Out) had been set by monotype! So far from debasing the quality of book production. has 
uuo ER К mechanical type-setting has greatly enhanced її: and there can be little doubt that what t 
авот СО € revival of printing’ and the present prestige of British typography have been largely а РГО cut- 
Ed E TEE policy of the Monotype Corporation, who, 50 years ago, embarked on a progressive 1® 
ing тона type fices, supplemented by new designs by artists like Bruce Rogers and Eric Gill [23] ork 
was g ne A of learning initiated by Charlemagne after the dark ages (c. 790), Alcuin of tory 
US Ge 1 € Abbey of St. Martin at Tours to superintend the copying of Church books. The 5 
Could for Sub The ио бип desired to write, declared that “a hand accustomed to the SW 
d akat onn implest shapes” ; hence the substitution of the plain half-uncial forms for the co 
t Tovingian hands. Being without ligatures, this simplified style lent itself admira 
the requirements of movable types when the time arrived. : ing 
? Those readers who wish to examine further the typography of the printers named in the follows 
paragraphs will find original specimens of the work in the cases in the King’s Library of the British Muse 
which illustrate the History of Printing. È 


+ Some ‘ old faces are now supplied with an alternative set of figures ranging on the line. 
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ait of type (Garamond and Fournier) may have been quite all right when people wore picturesque 
ei оез and lived in Renaissance palaces : in a technological age you want a plain technological type.” 
to Los kind of lettering (Baskerville) sets me wondering who drew those pretty curves, when I ought 
о m thinking who wrote this passage and what did he mean." “ Modern printing should be like 
c odern building, purely mechanical and functional, not a phoney revival like late Victorian Gothic 
T pseudo-Tudor house-fronts." 

It will be remembered that the factorial analysis of the specimens in our series subdivided the 

Faces into two main sub-groups, called for purposes of reference * Continental’ and * British ° ; 
and the former in turn was re-divided into what we shall call * Italian * and * French * respectively. 


1. Continental, The most distinctive features of the * Continental ' subgroup are the crossed W 
and the splayed M. Both these features were criticized by teachers and others who thought them 
CE eccentric ” for children’s books or “too archaic” for scientific books. _‘* The legs of the M seem 
let € slipping apart", says one. “Nowadays”, says another, ** double-U is no longer a double 

ter, and V no longer stands for О.” Others, however, thought these minor variations “ add life 


and interest to the printing ". 

fro (a) Italian? There is no single criterion distinguishing all the specimens in this subdivision 
us m the next. To judge by the comments of our readers, the common characteristic is largely 
iu ene; namely, a freedom from the decorative tendencies that are so marked in the other type- 
cis our continental series, particularly the French. The nearest approach to a positive diagnostic 
re 3 Is the curve of the v and w in the italic founts—neither angular nor rounded, but (to quote one 
"s er) a bit nouvel art”. However, so far as it can be defined, what seems to have led most of our 
t aders to rank all three specimens together at much the same level, either high or low according 
9 their taste, is an impressionistic quality difficult to describe, but summed up by one of them (a 


little Sarcastically) as ** a chaste art-and-craft appearance a А , 

th Bembo (270).2 This type, a comparatively recent revival, is perhaps the best representative of 

Da early classical design from which all our later roman types are descended. It won high praise 

EM both our ‘literary’ and our ‘scientific’ groups. A tiny point which raised it (with Plantin) 

Whe all the other * Franco-Italian ’ types was the lower case j, with its generous curve and bulb, 
ich prevents any confusion with і. The x-height is small, to leave room for the long ascenders and 


descend, : н ionall iticized : 
ers; ind ers are taller than the capitals—a feature occasionally criticized : 
( Sede te eae р letters in * The’ or ‘ When” suggest 


5 One reader remarked, “ the uneven heights of the first two hen. { 
ot the printer had unintentionally picked then from different founts "). As to the italic (which 
mes from a different source, see p. 54) opinions were divided. Teachers who were members or 
ег геге of the * Society for Italic Handwriting ’ praised the oblique serifs and the EID eirca 
with vals in the lower case, and thought the use of this type face in reading books E t be c € 
теа à revival of the so-called Italic style for children’s script. On the other hand, Ron dea ei 
ders Often considered these features “ rather mannered ". The italic y (* more like a Gree 


ma with a serif to its tail ") was almost universally condemned. И 
and œ nese (59). This was xa of the earliest of the artistic designs cut by the Monotype Corporation, 
was brought to them in 1911 by J. M. Dent (publisher of the * Temple Classics ' and * Everyman's 


1 m, c. 1454) were based 
architecture) which 


gu 
on a | 2 k 
hadhtdsgenerate Vernacular ( Burgundian Bastardas '—full of sharp points like Grune PUES en 


a Come current North of the Alps an 


ivi i i wned at f 
ich {ив out tha English. Charles Cres When Gutenberg’s foundry was burnt in the 
ubiaco near Rome, and there installed a printing press, the 
Scholastica, later fans оде p Maini 

; ved on to Venice . Since the books 
hem; but he eventually mo tiqua or humanistica then in favour 


ittera ani A 
patari o Anope iho ote manuscripts, and are the most highly 
On his death his material passed to the 


the prii у pe i Aldus Manutius printed 
I printers at Rome. | . with material from Jenson, Aldus Manutius printe 
а тац trae soon after setting up his press 2 J Venetian poet Pietro Bembo, Du on A leading humanists, 
GOs for hic c Aetna, written by ће 3 25 53 later Cardinal and Secretary to Leo X. The type, cut by 
shao of Bolo revival of Cicerone from the designs of Jenson : the e’s an E ave their moe amiliar 
таре; the à gna, differed somewha d their slab serifs abolished. Aldus Rem ана to have iscarded 
P type ake apitals are narrower, ар t another version, cut for his still more celebrated Hypnerotomachia 
iphij; most at once in favour of уе! hed the basis 


i for Garamond's French types, and so ultimately 
for Сас] d was this that apparently furnis 


y and his successors. R “Bembo for ps 
Min hose who wi: ji itability of Bembo ine’ 
Ryles Concept of o judea е iison 1949) and Valentine 


ychological work will find good irstances of 
s Parents and Children (Methuen, 1953). 
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Library") It was modelled, partly no doubt as a result of William Morris's influence, on a is 
heavy Venetian 15th century original.| As several of our readers observed, it is suited only to a fee 
stricted range of publications. Many teachers, particularly headmistresses, thought it excellent A 
children’s poetry books " or * for Shakespeare". The points that most frequently aroused comme ) 
were the slab serifs of the M and N, the sloping stroke of the e (both reversions to Jenson's denet 
and the thickening of the horizontal strokes іп the z. The italic is a modified version of that associa m 
with the Italian and French type-faces, rounded, expanded, and slightly thickened to secure grea 
legibility. | ыйа 
Centaur (252). This type-face was originally designed by the American typographer, аг 
Rogers, for Houghton Міћіп.2 The roman fount is modelled on Jenson's roman, with ШИШЕ 
serifs for the lower case, but without Jenson's slab serifs for the capitals. The most e ath 5 
features are the sloping cross-stroke for the lower case c, the incurved shank of the italic ht ра 
reversion to Jenson's designs), the absence of thickening in the tail of the j, and the straight 1 m 
stroke of the у, thickened at the lower end. In the re-drawing, the type seems to have lost mue i 
the robustness of Jenson ; and was even criticized by some of our readers as '* self-conscious D. 
"art-y ”.3 The italic, designed by Frederick Warde, is modelled on that of Arrighi (see p. 54)- 


inted 

(b) French.* The two peculiarities most frequently criticized in this group were the Pe 
bases of italic v and w and the swash descending tails of the italic 2: most of our readers thoug hts 
letters highly unsuitable for algebraic symbols, and many disliked them even for general use: athe 

Garamond (156). The roman resembles the Venetian characters of Aldo, but the j presor ents 
old form, that of an elongated terminali : the similarity caused a good deal of unfavourable eor an 
But the chief target for criticism was the italic— sometimes too narrow, sometimes too wer type: 
the capitals sloping at inconsistent angles ”, reflecting “ an unskilled stage in the development or the 
design". (Compare the slanting О and 5 with the nearly upright R, the splayed A, V, and x v sharp: 
narrow 5, B, and у.) The О, “ shaped like a written Z”, the curly £, the long-tailed с, the sped 
based v and w, the g with its wide lower bowl, the y with its back-swept tail, and the primitive Noor g 
h, “ almost like a b”, were disliked by тапу. А few warmly praised these peculiarities, as 5 
the type a marked character of its own ".5 pular 

Granjon (Linotype).5 We included this because we were told that it was one of the most po oint. 
linotype faces. The design has many similarities with the preceding : e.g., the j tapering to а 1055 
It differs chiefly in using the same form for the capital J (almost a straight line), avoiding the robably 
W, and reverting for the italic v and w to the form used in Bembo (the last two features р down" 
account for its being ranked above Garamond). Its most distinctive peculiarity is the long 
ward tail to the italic k—criticized by all who disliked any tendency to flourish. 


rt 

є e 5 
„ 1A similar type, based mainly on Jenson's roman, had been designed by Emery Walker for A or 55 
lived but celebrated Doves Press (1901) at Hammersmith. It should be noted that much of Willian ght) 


ington $ 
3 The psychologist will find an interesting example of this type—italic as well as reman—in Sherrin® 
Man on his Nature (Cambridge University Press, 1940). during jor 
* The essential characteristics of what are now called * old faces" became fairly well fixed 185101. n 
later years of Francois I—' the golden age of French typography '—largely as a result of his Pre of t 


the 


1 3 cutters 0! in 
Italian art and culture. Claude Garamond (c. 1535), the most successful of the French уре сие ҮП 
period, produced what eventially became the standard European type outside Germany. It see co t 


that copies of Aldus's work reached Garamond in Paris; and, as we have seen, it was ОП t 

Garamond's designs were based. g jjved U 
, In the 17th century, after a temporary decline, French printing, like French scholarship. Ie ik n t0 

Richelieu; and the Imprimerie Royale, established by him in 1640, started with what were toto 

Garamond types. Thus, the style that had been introduced a century earlier continued to domini, centu 
printing, with minor variations, until it was superseded by a‘ modern face’ at the end of the ] s tO “irei 
А ' Garamond ’ type was recut in France in 1898, and it was one of the first of the classical faces а Ше 
by Monotype. Mrs. Beatrice Warde (* Paul Beaujon °`), however, has shown that the punche put W? 
by the Imprimerie Royale in the 17th century were not, as they supposed, those cut by Garamond: alê 
in fact designed, nearly a hundred years later, by Jean Jannon of Sedan in 1621 [8]. nfortul in 
Professor Saw's Leibniz (Penguin series, 1954) is set іп Garamond. It was, we think, ап ks pe 

type to choose for Kenyon’s The Bible and Archaeology, where the juxtaposition of the italic 8 8 ce 
Egypt and archaeology) requires a very ugly logotype. ' prod! te 
5 Granjon was printer to Henri II (c. 1557) and afterwards to Pope Gregory XIII. The tyne er haps yal 
by G. W. Jones for Linotype, was based on 16th century French type faces. Mention should Ру. кору 


мей“. озі! 
made of another fount recently cut by the same desi 5 after a celebrate oticea nd 
Printer ' of the same period). ‘It has e olds esigner, and called Estienne (after a 3 n a 


; fashioned itali i а d, but it is chief? лде 
for its long ascenders and descenders—a fe: lic seen in Garamon s. Esti 


ature revived by Eric Gill in his ‘ Perpetua ， 
several other faces we were forced eventually to exclude for fear of overloading our tests. 
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Fournier (185).1 Several readers were puzzled by the serif to the foot of the roman b (seen also 
in Bodoni) ; and the lower case j (with no blob) was again the subject of criticism. Many also thought 
the roman fount too condensed, especially the M and Z : (the 12-point lower case alphabet has a length 
of only 130 points, i.c. 1-8 in., as compared with Garamond which has 144 points, i.e. 2in.). The italic 
evoked the most opposite views. Unlike the preceding founts, instead of providing a new set of designs, 
itis to a large extent merely a sloping version of the roman : thus the m and л have sharply sloping 
Serifs instead of curved initial hooks, and the upper right arm of the k has no loop. Several considered 
the inclination excessive (19° instead of the Garamond 12°); and the oblique bowl of the g and long 
downward tail of the z were frequently condemned as " irritating " or * trying too obviously to 

€ quaint”, 

Plantin (110).2 This type face, designed in 1913 by the Monotype Corporation, was the first 
Produced for use on coated paper. Its blackness on uncoated paper has made it popular for child- 
Ten's books. The italic, with the well rounded base to v and w, departs widely from the French 
designs, and was thought to be well adapted for mathematical use. Nevertheless, rather to our 
Surprise, it was far from popular with our scientific readers—chiefly because of the roman j (^ too 
like i 7), the crossed W's, and the italic J and О. One or two noted the gap in the P between the 
Upright and the end of the curve, and asked if it was deliberate or accidental; and several disliked 
the flat top to the capital A. s и 

_Ehrhardt.4 The roman j tapers to a point; both arms of the C have serifs; the М is splayed ; 
the italic J has an unpleasantly low Баг; and in both founts the Q has an elongated tail—features 
often Prompting unfavourable comment. Several also disliked the somewhat irregular bowls of the 
Toman and italic g's. The characteristics most frequently condemned, however, were the peculiar 
italic w (joined head-strokes, pointed bases, while that of the vis round) and the even line of the zero— 

more like a circle than a numeral”. This was the least popular of the old faces. donc 

2. British. England, unlike Germany and Italy, had no national pride in printing, and readily 
borrowed from the ‘other side of the Channel. Caxton's first book (The Game and Playe of Chess, 
1475) was printed at Bruges in an unattractive Gothic style ; and one of his later types (based on the 
formal liturgical hands—rextura 5) became the basis of the so-called * English black letter’. Shortly 
after the establishment of the Royal Society, Dr. Fell,ó Bishop of Oxford, imported punches from 

olland, which then had almost а monopoly of type-founding, and with these (which still exist) set 
Up the Oxford Foundry. The first competent engraver of types in England was William Caslon, 
Whose foundry has survived to the present day. 
by к а Сасин, терып ine р is gro} 
veral obvious points from the French and Italian. ү $ 
ave a wide set. The W'sare uncrossed, and the M’sunsplayed : the roman Q (except in recent forms) 
as à swirled tail. The j has a well rounded terminal ending in a bulb. The italic comes closer to 
€ English running hand, and is somewhat condensed. The Dutch origin is traceable in the high 
ar and swirled head to the italic J, and in the curved serifs of the rounded italic v and w. а 

Caslon. Caslon started a new foundry in 1720, modelling his letters on those of van Dijck, 

but greatly improving their delicacy. The result was a distinctly English face, which (except for a 


up by our factorial results are distinguished 
The roman letters, particularly the capitals, 


! The Fourniers managed the Le Bé factory in the 18th century. Р. S. Fournier ( le jeune °) proposed 
the * Point’ system in his ‘Manuel (1764), and started modifying the old French designs in the direction of the 


gout hollandais. — , : : 2r 
C ertrand Russell's /nquiry into Meaning and Truth (Allen and Unwin, 1940) and Murdo Mackenzie's 
9ntrast Psychology (Allen and Unwin, 1952) are set up in 12-point Fournier type. г саш 
2 Towards the close of the 16th century, superiority in printing, as in other forms of sc holarship, 
Passed for a time from Roman Catholic France to Protestant Holland. Christopher Plantin (1514-89), 
рот near Tours, established his press at Antwerp, and shares with Christoffel van Dijck (ty pe-cutter for the 
lzevirs) the credit of producing some of the finest Dutch work. Their designs were sturdier versions of 
se produced by Garamond and Granjon. There is an Interna Plantin with more irregular italic. 
either version is | j t касі | duction of the original. а е 
is by any means an exact reprodu Ог Е va 
«It stands up well io the demands of high speed rotary printing. The most manat ats of its use 
Н Ше Listener. For the psychologist an instructive example is the У ири сіл ЕЦ the 
erd (Oxford University Pi 1953), which shows the blackness oi th ипсоа арег. a 
by hi he E ue. suggested by that used by Ehrhardt of Leipzig (c. 1720) and derived 
m from of Anton Janson. ч uk f 
“te e e E a large upright hand, wiih E angles ваб по curves, Having: а woven 
Xlure* : (Cf. * Fair as a text B in a copy book, Love's Labour's Lost, V, il, A y th t ry it ha 
Tiperseded е дета с елер та Г School (11-12th century, a blend of Anglo-Irish with Caroli; ian). 
the Authorized Version (1611) was printed in textura. Caxton himself explained that his books were 
t 


« NOt for a rude vplondyssh man to rede, but onely for a clerke and вепіуітап ”, and he therefore aimed at 


a mean ji RA " ~ (Prologue to Eneydos). se М á 
b: s ا ت ی ل و ا‎ from he doggerel parody of Martial's 33rd Epigram, written 
У Tom Brown as an Wnderpraduate, after Fell (then Dean of Christ Church) had expelled him. Besides 
ieMon's press in the precingts of Westminster Abbey, presses had existed at Oxford and St. Alban’s from 
ae 15th Century. Of the several early foreign printers who set up presses in London, Richard Pynson, 
lerwargs printer to Henry VIII, used a * white letter * to print an Oratio for the papal nuncio (1509). 
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spell of neglect in the 19th century) has remained in continuous use for over 200 уеагѕ.1 Perhaps 
because the Declaration of Independence was printed in Caslon, it has been still more popular in the 
U.S.A. The minor irregularities of shape and weight give it, as our readers put it, a “ homely, 
unassuming, old world look " ; and led others to condemn it as ** unskilled " or ** primitive ”. 

In the roman the distinctive features are the wide H, M and U—the H indeed being frequently 
criticized as“ disproportionately broad”. Several disliked the obliquely flattened top to the A, 
and asked if the type was not slightly battered. The irregularities are most marked in the italic. 
In both Caslon and Baskerville the italic Y resembles a Greek upsilon (a shape which excited repeate! 
criticism) ; the lower case w has an open loop ; and the C in both roman and italic has a well marke 
serif to the lower arm (sometimes omitted in smaller sizes of Caslon roman). With the Caslon italic 
the downstrokes to the angular letters, A, V, and Ware parallel to those of the square letters, Н, M, 1, 
К, etc., so that the former seem to be toppling forward. As a result, the gap between the capita 
inital and the rest of the letters sometimes breaks the word in two—a point seized on by most of the 

eacners,- Р 

Baskerville. Тһе most conspicuous peculiarity of Baskerville is the pointed unserifed junction 
of the two inner strokes of the large and small W—a feature commended by many of our readers, 
who argued that “in English W is a single sound, not two consonantal V's". In the g the lower bow 
is not completely closed. Апа practically all the letters display an increased contrast between the 
thin lines and the thick. i 

Owing to a free use of kerned letters the italic has an individuality of its own. The terminals 
are rounded, and the thickening has shifted from the bases to the down strokes—characteristics 
suggestive of elegant penwork. Compared with the roman, the italic shows even greater condensa- 
tion than Garamond, and was consequently criticized as “ crowded ”. Considering the popularity 
of this type with many publishers, the criticisms we received were unexpectedly frequent. Some 
thought the general effect “ a little too genteel”. The decorative tendencies of the italic excited both 
praise and blame. The ** pothooks and hangers " of the italic m and л and the upstroke to the italic 
р were said to be “ suggestive of copperplate engraving " ; the curves of the k (as in Fournier devol 
of a loop) and of the z were often considered “ fanciful ", and the swash italic K, N, T, and Z, an 
the generous tail of the Q’s (roman as well as italic) “ much too distracting ".4 On the other hanc; 
several readers placed this type at the top of their list; and many picked out the very characteristic? 
ter others so strongly criticized as '* making the type far more interesting than ordinary humdrum 
print”. 

. Old Style. The design that originally bore this name (cut in 1860 by Phemester for a Wi e 
Victorian firm, Miller and Richard) sought “ to supply a type with old face qualities without а 
archaic features of Caslon ”.6 It was so successful that it was quickly copied by most other оше 
in this country and by printers in Germany, Scandinavia, and the U.S.A. Thus, by the Бевіппіпе 
of the twentieth century the majority of books were printed either in a variant of Old Style ОГ e 
one of the * Moderns’: and probably these publications moulded the notions of our older reade $ 
as to what the proper shapes for English letters should be. There are numerous minor variation? 
of the * Old Style’ face: e.g., Monotype and Intertype have an A with a pointed apex, while Lino 
type, like the earlier founts for handsetting, usually has a flat top. We used the Monotype version. 
(Series 2)—one of their “ early bread and butter founts " (as they call them) cut in 1901. The Sere 
on the upper limb of C and G has a well marked slope, and the G and b have spurs， The absent, 
караа RA xe even colour vae Praised by most of our readers. Comparatively few Spe 
p үеге criticized ; but a number of our younger li general appe? 

rather insipid”, or “ harmless but lifeless”? و‎ dereio Eghte Е 


ell known 


haw. shag 
to be set in Casl The pay now 
e set in Caslon long primer unleaded—'' no rules, no flower i 1 Е tati 

1 s, no heading r Cross , ation 
many have been repelled by the solid grey oblong pages of his prefaces, E HET single indentatio' 


3 John Baskerville (b. 1706) was a writin i i s 
Ss еә; OS g-master at Birmingham. Afi eight years idge: 
ffaigning a rod suitable for ** books of consequence”, he Beckie printer te ihe DNE) o amb Eta 
Pur атоо woverpaner and novel printing techniques made it possible to produce a far clearer ant ре”. 
; and, as Macaulay puts it, his Virgil “ went forth to astonish all the librarians of Eu пош! 


' 
5 

son 
ergus ind 


1 
For examples of Baskerville, see Ayer's Language, Truth and Logic (Gollancz, 1946) and ETE ly 
ceding” 


hiswick Press for a fictitious 17th century Diary- h 
print Tupper’s Proverbial Philosophy and lorrisv 

the century Ruskin, Emery Walker, William Мотел 

ographical renaissince ” that eventually made “mo 

See above, p. 48, footnote 1. 


Charles Ricketts initiated “ an aestheti 
printing the soundest in the world ” DL S 


50 


CYRIL Burt, W. F. COOPER, and J. L. Martin 


Imprint. This was “ an all purpose design " cut by the Monotype Corporation in 1912 to the 
Specification of G. T. Meynell and J. Н. Masson for a new monthly review called The Imprint, It 
Proved to be one of the most popular faces in our series. The general style is that of Caslon ; but 
the type is larger on its body, and gets rid of the peculiarities that strike the modern eye as primitive— 
€.£., the marked and irregular slope of Caslon italic, the serif to the lower arm of C, and the curved 
italic У. The roundness of the lower case italic resembles that of Baskerville, but the central loop 
to the italic w is not visibly open. The one feature most frequently criticized was the tiny tail 
to the roman Qu М 

Times New Roman. The Times newspaper, originally set in a rather crude old face, changed 
early to modern (1799), and kept it for over a century. On October 3, 1932, the paper arrived at the 
breakfast table in an entirely new type, specially designed by Mr. Stanley Morison (Reader in Biblio- 
graphy in the University of Cambridge and Typographical Adviser to the Cambridge University 
Press and the Monotype Corporation). А year later the design was released for general use. At the 
Present day it is probably more widely used than any other face. For its original purpose. to 
Combine legibility with economy of space, it is admirably adapted—large on its body, compressed 
in spite of its x-height, yet with well open counters, and furnished in its original form with short 
Space-saving ascenders and descenders. We found it amazingly legible in the smaller sizes. 

Its more obvious characteristics classify it with the * old ° faces. Тһе serifs to the capitals are 
bracketed ; while those of the lower case are wedge-shaped ; the shading of the lower case roman is 
Strongly biased (indeed, the tops of the curves in b and p and the bottoms in c, e and q were sometimes 
Criticized as “ disproportionately thick”). Cand G are clearly distinguished; the J is non-descending; 
the Q has a small central claw tail (perhaps leaving the letter “ too much like an О”). The italic 
is only slightly sloped (16°), and tends to repeat the design of the roman: e.g... the i, j, т, n, and p 
Commence with straight serifs, and the v and w have angular bases. Mathematical readers objected 
that some of the italic was unsuitable for algebraic equations, in particular the v, which is indis- 
paBuishable from a Greek ли. Оп the whole, however, this face was one of the very few that were 

Pular with both the literary and the scientific groups.? А Р i 
Perpetua (239).3 Нес of its exceptional interest, we included this type face in our first and 
larger series ; but, since it has rarely been used as a book face,4 we dropped it in our final experiments. 
: 1€ roman /ooks an extremely legible face ; but many readers found that after a while it proved rather 
tiring. The italic aroused the most conflicting judgments, even among our literary group. There 
Was, however, a general agreement that, for text matter as distinct from a title page, its peculiarities 
lend to thrust themselves between the author and the reader. Accordingly, in view of its rather dis- 
tracting quality and the unusual form of the numerals, we had little hesitation in deciding that it 


as qui i ica ications. 
Quite unsuitable for mathematical publicati tics of the so-called modern faces are (i) the 


‚В. Modern Faces The distinctive characteris sp 4 piena 
Vertical instead of biased shading, (ii) the marked contrast between the thickness of the ‘ down strokes 


! For exa i e reader may take McDougall’s Psychology (Oxford University Press, 
Д Originally oor оГ трі тар 1912) or Burt's SW MIO University Press, 1935). 
his Journal z it. J. Med. Psychol. are set in Times New Roman. 
S Реге PONE TRE E CR martyr) first appeared in an exhibition of types SEED ab the 
the Bride Institute in 1930; but it seems to have attracted itne interest from ue Бур есет ашан а 
War. It w drawii ade by Eric Gill, лог (so he says) with specia T И, = B 
cay; Oman follows the ete of ihe best chiselled pO E кейе ысы 
Ing as an int iary sta The narrow E and S, the slightly splay: » о Я 
the thrus aiene tiary SIRES i ally. seem manifestly copied from the cele- 
t-out leg , d the proportions generally. seem m Uy ¢ 3 ‹ 
brated Bane Mop or the E and n U. Рае which does not exist іп remi is uncial. КЕЖЕ, 
oll declares that the designs preserve. that commonplaceness and normali ТЫЫ, ите the idiosymn- 
0 ae alia Te miel lace nor normal. п yün 
a neither commonp! A M "e 
MUI eet К The stress is usually vertical : the serif are hairline but bracketed, 
izontal in th A Be in theitalic; the O is circular ; the E equal-armed j (10 Pon саде 
the to e roman, oblique in Drizontal: the upper stroke of the 5 unusually long. The italic 


is а trokes, and the U has both its verticals 
па roman. The В, D, R, and Р e ОНТО, turns up and joins the bowl, so 
ned. But the most striking pecu 


at the letter г б > 
esembles an inverted B. i . Selected Poems, by T. S. Eliot, 1952) ; 
and ult has recently been used for the text of several Penguins (e.g. Sele ^ 


the з > "lerim. has been adopted for the current issue of The Yearbook 
of Educating (ent linotype versions сапа eder the style will probably be most familiar from its use 
T headlines in ans, 5). d dvertisements catering for educated or pon rues rea Es (e.g. those 
a the Arts Council H.M. Stationery Office, and even the wrappings of household commodities intended 

T the “mor ‚Н.М. Sta d 
$35 уге refined classes ’). ment among our testees that Perpetua was “ the 
mos delfe! ing and display DE. bi gepen AEpe-faces suggested for the cover of this Journal it 

Cei 2 

Ed by far the largest number of voles. opular in France at the close of the 18th century when 
Classical” 50 galled “modern” faces Богато ре at Paris. The alleged model was the rigid sculptured 
26, голаву агі became ie Gent inscriptions, many of them qewiy idiscoyereds hich were 
ed aesthetically. Ead e freer and more flowing characters traced by the monastic quill. The 
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(i.e., those that are vertical or run down from left to right) and the thinness of the curved strokes and 
the * upstrokes ° (i.e., those that run up from left to right), (iii) the fine horizontal unbracketed serifs 
(iv) the figures ranging on the line, and (v) generally, the geometrical and rigidly mechanical structure 
of the design. 

As we have noted in our studies of other forms of art, there seems to be a kind of * temperament " 
(if we may use such a word to designate what is quite as much a cultural, epochal, and often national 
attitude as an effect of innate constitution) which tends to admire geometrical, symmetrical, and 
perpendicular qualities in human art and to favour a scientific or technological design as contrasted 
with what is imaginative or emotional or gives free rein to fancy—in short, a classical taste rather 
than a romantic. During periods in which one or other of these tendencies becomes dominant in 
sculpture, architecture and literature, it generally appears, after a little delay, in commercial and 
domestic art, and ultimately in typography. The correlation between the various manifestations was 
obvious not only in the results of our tests, but also in the introspections. People who showed the 
preferences of the * stable introvert ° in other forms of aesthetic appreciation (cf. 18, p. 297) seemed 
also to reveal them in their preferences for type faces ; and frequently they used the same arguments 
to justify both forms. 

Here, as before, the factorial results subdivided the various type faces belonging to this broad 
class into two subgroups, the Continental and the British. 


1. Continental. Та the * Continental’ subgroup the characteristics distinctive of the * modern ' 
face appear most clearly. The * British ’ as usual show an inclination to compromise. А 

Bodoni (135).1 Morris vehemently objected to “* the sweltering hideousness of the Bodoni letter, 
the most illegible ever cut, with its preposterous thicks and thins”. Our readers generally ranked it 
last of all. Numerous incongruities were noted. Since the alphabet includes letters, like У, W, 
X, and Y, which are composed solely or mainly of symmetrically arranged oblique strokes, it 15 
impossible to apply the principle of vertical thickening with complete consistency. Moreover, In 
the upper case the inner limbs of the W cross, in the lower they meet in a point, as in Baskerville. 
The C has a serif to the lower limb, like Baskerville ; the b a foot serif, like Fournier. The top of the 
tis cut off flat. The italic is only slightly sloped (10°): some letters begin with a horizontal sert 
like printed roman ; the v and w (but not the и) with a rounded curve like a running script. The 
right hand upper limb of the italic у is curved outwards ; and in both roman and italic the capital J 
falls only a little below the line—" as though the printer could not make up his тіпа”. The square- 
ness of certain letters, the correct symmetry of others, the parallelism and rigidity of the lines, led 
many readers to complain of the * geometrical ' design : “ it might almost have been drawn on square! 
paper with a ruler and compasses, and then carefully shaded in." On the other hand, several of our 
subjects observed that, owing to its emphatic character, it seemed highly suited (in the larger sizes) 
for commercial advertising or headlines in newspapers. 


result was a somewhat stiff and pompous style, like that of academic painting and sculpture during the years 
when Louis David was director of arts. _It should be noted that an upright style of writing existed long ago 
in this country (e.g., that used for the Lindisfarne Gospels, an insular adaptation of the half uncial hand). 
The alphabet then in vogue contained no v, w, or y: with these letters, in order to avoid making bot 
downstrokes thick, the method of writing has to be changed ; and so an obvious incongruity arises. 

Many teachers who favour an upright style of writing require their pupils to hold the pen at right- 
angles to the ruled or imaginary base line on which they write. The upright characters of the carly 
Pause Piss however, were obtained, not by changing the position of the hand, but' by cutting the n! 
obliquely. 

_ The common notion that the * modern ' style was first introduced by Bodoni and then taken up by the 
Didots is quite mistaken. Similar characteristics are traceable in several Italian manuscripts of the late 
renaissance. Italian architects and painters, and later the French savants of the Académie des Sciences 
under Louis XIV, had constantly interested themselves in inscribing letters in squares, triangles, and circles 
according to definite proportions, and in attempting to formulate artistic taste in terms of mathematical 
principles. As we ourselves discovered, it was a popular hobby with several male teachers and craftsmen 
who took our tests : see, for example, the * general remark ’ made by one of our readers, and quoted above 
ander тайпа) ора (р. 40). Р - 

! Giambattista Bodoni (b. Piedmont, 1740) was head of the Stampa ale at Parma. The rathe 
dazzling brilliance of the contrast between the heavy main HN Stampa Reale t sae and possi 
the fact that such type required good paper and was itself a sign of an amazing technical precision, rendere! 
the new style popular for all serious work. Bodoni printed editions for Gray and Horace Walpole， who 
warmly applauded the * new continental workmanship’. He started with French models. Indeed, the 
thin horizontal unbracketed serifs that are so characteristic of the * modern face’, as well as the narrowing 
of the letters, are already descernible in type cut by Grandjean (1745), who had been ordered by Louis Y 
to produce a new design that should be the monopoly of the Louvre—the Romain du Roi. But it seems to 
have been the new delicacy and emphasis introduced by Baskerville which Bodoni and the Didots chiefly 
admired and endeavoured to magnify. The new Bodoni Press in Italy has recently produced remarkable 
examples of the typographer's skill. “In this country, however, it has become confined to titling, advertise” 
men and letter поет accompanying art books on ultra smooth paper. | d 

he versions оГ“ Bodoni’ cut by British and American foundries are more han the original, а? 
the heart shaped italic v designed by Bodoni СЕАТ open than the shai 
conforming DR the w: 8 у has been discarded (except by German printers) for à 
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Didot (520).1 This type face is intermediate between the older French faces and Bodoni. 
The italic is almost identical in form with the roman, and its inclination is slight. Тһе wide H and U 
and the pointed j were freely criticized. 

Walbaum (374). This is a German ‘modern’ type (1810), revived by the Curwen Press and 
later by Monotype. The design is based on Didot ; and both roman and italic are wide and therefore 
readily legible. The roman, with its non-descending J, spurred b, e with a large counter, and square- 
topped t, suggests a lighter version of Scotch Roman, though the pointed j indicates its French 
origin ; the italic гапа w, though wider, are reminiscent of Bembo and the double curved v of Bodoni. 
phe descenders of the p and q have no serifs. Walbaum and Didot were among the least popular 

асеѕ,2 


2. British. English printing has always been characteristically light. Hence in Britain, even 
modern book faces have tended to avoid the excessive thickening of their continental prototypes, 
though in the 19th century Thorne and his imitators set a fashion for ° fattened ’ styles that have 
remained popular with commercial firms. For the most part anything glaringly foreign was dis- 
carded, and several distinctively English features due to Caslon and Baskerville were preserved. 
At the same time the British versions introduce one or two points that are wholly new. 

Bell. What is sometimes called the earliest English modern face was cut in 1788 by Austin for 
the London bookseller and publisher, John Bell, who was the first to discard the long-tailed s. In 
Contrast to the condensed modern faces of Continental printers, its well-rounded letters give it a 
distinctly British look. It was recut by Monotype from the original punches found by Mr. Stanley 

orison in a lumber-room of Stephenson Blake (the Caslon Letter Foundry) Like other modern 
faces, it has horizontal serifs, perpendicular shading, and a large counter to the romane. It hasa 


sfraia 5 ital ser ain italic J standing on the line (though the roman drops below). 
ight serif to the italic j, and a plain italic J n he curved upper arms adopted by Caslon 


n the other hand, it retains the peculiar italic Y, with t 1 
and Baskerville. d reverts to the excessively sloped V and W^ of Caslon. The lower limb of the 
italic К, K and R are curved, like those of Baskerville; and similar curves now appear in the roman 
k, К, апа R. The italic x is also curved—formed by two cs placed back to back. These curvilinear 
pocyations were widely criticized by our readers ; but with the scientific group it was one of the most 
opular of tho older founts.3 x 
Scotch pos (46). In matters of literature and the arts Scotland has always kept in close con- 
tact with France ; and there has been, and still is, some degree of trade rivalry between the printers 
of Edinburgh and Glasgow and the presses of Oxford, Cambridge, and London. It is not АШЫН ШЕ, 
therefore, to find that the changes distinctive of the new French printing at the close о Де 18th 
Century were quickly reflected in Scottish publications. In 1815, Alexander ison, a ‘ AEN 
Printer, introduced what he called ‘Scotch modern face —a. DE version О UE. 
he ascenders and descenders are shorter, and the serifs have slight brac È 5. уре E 
anne are the non-descending J, the Q with the tail beginning See omor aay tgs Bodoni 
the с i ending with an upward turn lik ü . 
the top of the oe oe еке "The italic is not unlike that of Baskerville, bubless condensed 
and more emphatically shaded. Тһе p, however, begins with a hook like i, j, m and л; and the x 
resembles that of Beli The y is new, with a round bowl like that of the ordinary SUA hand. 
Indeed, many of our readers complained that the Ts rd TE UE Ad кыы] 
© cheap printed visiting card, “ with * thin upstrokes an thick dowi kes’, as жер 
Schoolmaster used to say". This became one of the most popular бш ates sein У. — 


. Modern Series . * type face (Series 1) was « [ 
Series 7 (an саш ЖЫДЫ indfudes over 600 * mathematical sorts, and is the fount adopted 


by the н й cfc iversity Press, 1954). It is thus a specially 
h author: 7 "inti Aathematics (Oxford University Press, © н у 
Important pr E dulcia eem In its most obvious characteristics It resembles Scotch Roman, 


M A Я : in verticals have brackets: thus U has a 
ОЗ ОГ the serifs, however, are unbracketed ; but {he M serif at the top of the right handan 


hair-]i А Y 
i ne at the t hand upright and a triangu de 1 
aonsistency оаа noted. Unis Seo T bowi, like ЧО g eg а P iod 
“> In most ot У italic, has a plain ova , ааб ше 

e w hasa te instead of a loop ; the v and w start with a simple hook (like 


т, n, and u Н i « с of older British founts. FEIN 
] r ex curve of o à 
i mong our suene Sean Monet 7 appeared the most popular of all. , The non-scientific 


Sriticizeq z Sn d times on aesthetic: many objected that it had 
5 c grounds, someti ! ic ted tha 
an Ui ERES Ies Qr pei leeks Others thought it " too rigid and mechanical”; several 


by the Didot family at 
ped ufterwards Louis XVII 


the very beginning of the 19th century : 


1 Py 
In France the gout nouveau Was develo; I) and the Comte d'Artois (afterwards 


the mo, c 
t celebra i i Monsieur X > м ( 

harlestcelebrated were printers to Monsieu Lonis ХУШ) Ane reformed, the point system, 

making ee edt ie Ee Me e metre Pob. "Their design, o cate uem or e 

int т 1 printi i ast 1 ears, 

in Tied style for French printing, where, during the last y 

з Brain (Duckworth, 1953). А 

n and Unwin, 1955) is printed in 11 point Bell. 
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who laid claim to typographical taste believed “ it was time that scientific publications attempted a 
style of printing less crude ” (or *' less inartistic” or “less old fashioned ”, i.e., 19th century).! 

Italic. In general italic type differs so widely from roman that we considered it essential to 
obtain preferential orders for the two founts separately. 

What we now call italic was first introduced by the Venetian printer Aldus Manutius in 1501 
for his pocket editions of the classics. For a more popular kind of reader it was natural to turn to 
a more popular kind of letter—the everyday cursive which had grown up side by side with the formal. 

Moreover, a running hand tends inevitably to slope ; and the slope allows greater condensation and 
SO economizes space : (with Baskerville the italic takes only 87 per cent. of the space required by the 
roman). Griffo, who cut the type, is said to have taken Petrarch’s writing for his model.2 These 
early attempts were somewhat crude and found few imitators: in particular much difficulty 
was experienced in sloping the roman capitals so as to give them the appropriate inclination. The 
standard italic is derived not from Aldo but from Vicentino (Ludovico degli Arrighi of Vicenza), 
a writing master in the Vatican Chancery, who (like Baskerville later on) turned printer and published 
(1522) а copybook claiming to teach /ittera cursiva o cancellerescha : this was a nco-Carolingian 
minuscule, used for papal * briefs ’ (i.e., short notes and informal correspondence) ; and nearly all 
the * swash ° capitals that appear in later faces are to be found among Arrighi's decorative majuscules. 
Robert Estienne,* one of the most celebrated scholars and printers in the reign of Henri II, adopted 
Arrighi's letters as his model ; and most later italic founts are descended from that. ° Italic ’ remained 
for long an independent rival to the * roman ’ letter. Being less bound by convention, it has varied 
far more widely. Hence it is always easier to recognize a type-cutter from his italic than from his 
roman. 

In England ‘ the sweet Roman hand ' (Twelfih Night, 1, iv, 31) only became popular with the 
cultured classes about the time of Elizabeth I. Half a century later, as England gradually acquired 
the carrying trade of the world, English commercial clerks developed a more rapid and rounded 
script, which became known abroad as anglaise coulée, The letters of Snell's commercial copybook 
of 1711 differ but little from those of Vere Foster on which the oldest of the present authors was brought 
up. The English type-cutters of the 18th century naturally modified the Franco-Dutch italic to 
conform with the national style. Baskerville, who was himself a writing master, and the later printers 
of * modern ' types, tended more and more to follow the flowing script of the copperplate engravers. 
During the 19th century, when italic ceased to be an independent type-face, one would have expected 
it to harmonize more closely with its roman context. Early attempts in this direction, as we have seen. 
were made by Fournier and others at the close of the 18th century ; but in this country the principle 
has still secured only a half-hearted acceptance. 

The motives which seem thus to have determined all these different styles are not unlike the 
reasons given by our readers to explain their own preferences. Those who prefer a scholarly an 
dignified hand admire the humanistic pattern of the older Italian type-cutters ; those whose own 
handwriting tends to flourishes enjoy Baskerville with its swash capitals ; those who look for 2 
hand that is swift, practical and free from stylized mannerisms prefer the smooth and uniform 
roundness of Modern 7. Many of our teachers in particular stressed the fact that children often try 
to “ write like a book ", and produced several instances in which pupils suddenly altered their lettering 
on the introduction of a book with a novel type face. (A familiar instance is the occurrence of the 
Girsel e, d, and п m the hand ate of classical scholars.) 

mong readers who took part in our experiments the sharpest c inion a between 
those who like their italic to contrast with the roman and ine on аг it should conform an 
harmonize, But here again much obviously depends on the reader's implicit notion of the functions 
of such a type—a point too frequently ignored. Is it to be used for prefaces, for passages quote 
from ancient authors, for initial abstracts or summaries, for mathematical theorems, to provide an 
algebraic notation, to emphasize a conclusion or key phrase in the text, or merely for side-headings， 
stage-directions, and the like? For emphasis or headings many thought it logical to use. nO 
a cursive and therefore a lighter ў et JE mote гове her 
e an a lighter type, but heavy semi-bold roman. But all this raises yet anot 
ei e any problems awaiting psychological research. The layout of commercial advertisements 
w cen rendered far more successful by the use of trial inquiries planned by a statistic? 


s 
Ko included "беп one of the modern type-faces that nr 
, popularit d 4 was rat ite hi all gr e 
Exremely short deseenders and e ang matase in а certain newspaper, but ow oy literary readers, Tor 
newspaper work ; but ere criticised by manters (notably in the е) no doubt are particularly suitable 
comparison with Pet, у dis 
1l, 1, p. 128. yop rarch's hologr pui 
3 See A. F. Johnson and S. i x 
pp. 23:52. ds S. Morison, * The Chancery Types of Italy and France ^, Fleuron, Ul, 1 
4 i i 
Fives pou. MAE RUSSE. for his Greek New Testament which remained the * Received Text ' for € 
extus Stephanicus, A D 1550". пу By es studied the Greek Testament in a reprint ар 
or Centaur, modern designs have usually reverted to the originals Ly apex ed аы 
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Psychologist. Might not publishers benefit by similar preliminary investigations, instead of trusting 
to the conventional or impressionistic judgment of their production-manager or printer? 

Mathematical Publications. The needs of a mathematical publication are somewhat specialized. 
The italic letters used as algebraic symbols seem to have become pretty well fixed during the 19th 
century when textbooks were for the most part printed in a * modern’ face. Hence many of our 
readers thought the g, v, w, x, and y as used in this journal “ looked like an infringement of the cus- 
lomary symbols". The symbols suggested in The Printing of Mathematics are certainly superior ; 
but even they are not wholly satisfying to the statistical psychologist. For example, the * bold face 
Series ’ there proposed (which the statistician would often need in matrix work) do not belong to the 
Same type face nor do they correspond with the ordinary letters, as may be seen by comparing the 

5, Q's, v's, апа w’s. Moreover, a statistical journal has additional problems of its own. Modern 
Statistics requires Greek letters for the * population ' corresponding in slope and x-height to the roman 
letters used for the * sample’, and a variety of symbols far wider than that of the available alphabets. 
For these and other reasons Professor Kendall holds that there can be “ no general standardization 
of symbolism, at any rate for some time to come". On the other hand, he agrees that there might 

е "^ а case for standardizing symbols in a particular branch ” (26, p. 8). These, however, are ques- 
lions that we hope to take up at a later stage in our inquiries. 

A Note on the Psychology of History. In our paper (р. 44 above) we ventured to suggest that 
the history of typography might provide instructive illustrations of the broader psychological prin- 
Ciples that underlie the course of history. Each of the great philosophers of the past has produced 
his particular * philosophy of history ' : each has seen in it some kind of pattern, usually conforming 
With his special brand of metaphysics. The professional historian has, needless to say, usually 
Scouted such generalized interpretations. Elsewhere, however, one of the present writers has argued 
that what is required is not a philosophy but a psychology of history, based on a direct analysis of 
empirical data. The historical study of typography would, we believe, admirably exemplify, in 
simple, concrete, and readily accessible form, most of the general principles that have been invoked 
to explain the larger course of events: the ‘swing of the pendulum’, the ‘ threefold dialectical 
Movement’ beloved of Hegel and Marx, the * evolutionary pattern ^, the * patternless pattern * (“ just 
One dam’ thing after another”, as Professor H. A. L. Fisher maintained), and finally the economic, 
the technological, and the cultural patterns. The study of such trends is primarily a problem for 

e psychologist, a problem which, perhaps because of its complex nature, the psychologist himself 
1a8 hitherto left severely alone. Nevertheless, it is our firm conviction that, by starting with the 

1510гу of relatively simple forms of human activity, such as the various arts and crafts, to which 
typography forms an excellent introduction, an illuminating series of researches might well be under- 
taken, which would yield valuable data and instructive conclusions both for the historian and for 


the social psychologist. 
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. Note on Table of Type Faces. For the greater part of the nineteenth century printers had prac. 
tically only one style of type at their disposal; today they can select a design from almost any perio 
in the history of printing. The table opposite shows the most important of these faces, ап! 
intended to illustrate the specimens chosen for our experiments and the criticisms made in the 
introspective comments.! To aid the reader in distinguishing the characteristics of the classes 9 
type face indicated by the factorial analysis, we have included only those letters that vary Wide 
from one fount to another; and, instead of following the usual alphabetical order, we have place 
first those letters which have variants common to the largest number of families or groups. The 
nominal size is the same throughout, viz., 12 pt. In the text, where letters from different founts have 
been described or discussed, the exigencies of machine composition have forced us to set them in t 
type-face used for the rest of the article, namely, Times New Roman, лог in the particular foun 
referred to. P 

1 We regret tl i " = s — issions 

ne Rf er, а 


пег 
puts it, “ they seem to aid the flow of the eye” (a phrase which is i itici п our exper 
ments on numerals we certainly found Gill Sans better for short OP EU. za UM timetable 
where they also facilitate economy of space); but they were less satisfactory for long numbers (as in mathematical tables g 
presumably because they do not combine so readily into distinctive groups, But we hope in the near future to publsers 
separate investigation on type for numerical publications, such as ca ildren’s arithmetic books, tables in statistical рар 
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SPECIMENS OF TYPE FACES 


Bembo 


Centaur 


Veronese 


2 Garamond 


Fournier 
Plantin 


Ehrhardt 


Caslon 

Baskerville 

Old Style 

Imprint 

Times New Roman 


Perpetua 


Bodoni 


Walbaum 


Bell 
Scotch 


Modern 7 
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INTERNATIONAL CONFERENCE ON FACTOR ANALYSIS 


The Centre National de la Recherche Scientifique, aided by a generous grant from the Rockefeller 
Foundation, is organizing a Colloque International on * Factor Analysis and its Applications 7, to be 
held in Paris from 11th to 16th July, 1955. The provisional programme is as follows: 


11TH JULY. Inaugural addresses and report on the Uppsala symposium. 

Professor L., L. Thurstone, * Recent studies in factor analysis °. 

Professor Sir Cyril Burt, * Factor analysis: historical survey of methods and results °. 

Professor H. Piéron, * Le probléme général de la recherche et de la nature des facteurs en psycho- 
physiologie ’. 


12TH JuLy. Professor Н. Hotelling, * Relations of the newer multivariate statistical methods to 
factor analysis ". 

Professor H. Pineau, * Remarques sur l'analyse factorielle de Hotelling et comparaison avec le$ 
méthodes centroides ’. 

Professor M. Yela, * Résultats et signification psychologique de l'analyse factorielle °. 

Mme. G. Bernyer, * Les facteurs psychologiques: leur nombre, leur identification, leur nature ’. 

Professor M. Reuchlin, ‘ Facteurs obliques, facteurs orthogonaux, facteurs de second ordre, en psycho- 
logie’. 


13TH JULY. Dr. L. Guttman, ‘ The Radex approach to factor analysis ". 

M. J. M. Faverge, * Analyse factorielle et combinaisons linéaires de variables ". 

Dr. H.J. Eysenck, * The validity of factor analysis °. 

M. E. Schreider, * Application de l'analyse factorielle à l'étude de la variabilité biologique ’. 
Dr. S. Ledermann, * Application de l'analyse factorielle à l'étude de la mortalité ". 


15TH Tuus _ Professor G. Darmois, * Quelques résultats théoriques et leurs conséquences pour lgs 
applications °. 

PE В, „Delaporte, “Nouvelle méthode de statistique math ématique pour l'estimation des 
acteurs’. 

Dr. A. H. El Koussy, * Trends of research in space abilities ". 

Professor T. Husen, * Factor analysis of achievement tests °. 

Professor E. A. Peel, * The factor analysis of correlations between persons with independent deter- 
miners to identify factors'. 


16TH JULY. Sir Cyril Burt, Summary of Conference. Final Discussion. 
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BOOK REVIEW 


Faster than Thought: A Symposium on Digital Computing Machines. Edited by B. V. BOWDEN 
London: Pitman & Sons. Pp. xx 十 416. 


MucH of the calculation required in statistical psychology will in the near future be performed by 
electronic computers. Hitherto, psychologists who have devised working techniques for factor 
analysis and similar procedures have striven to keep them as simple as possible: usually they have had 
In mind a student or research worker equipped with only a hand machine or with nothing at all. 

his Preference for easy methods—for graphical rotation instead of arithmetical rotation, simple 
Summation instead of weighted summation, guessed communalities instead of correcting for specificity 
Ог applying maximum likelihood—has frequently provoked the criticisms of the professional 
mathematician, and is now gravely impeding the development of rigorous and efficient formulae. 

he time has therefore arrived for the psychologist to adapt his working methods to the improved 


© drudgery a esses the white collar worker. 
gery and boredom that now oppress: е The first dealszwith һе 


he book, which i i alls i hree main parts. 
, s pr ely illustrated, falls into three main p: t 
hich i protiy he second with the commoner types of machine 


re important fields of application. 
presents a brief historical survey 


Of calculati i i heir accounts on clay tablets. To the 

ing devices from the days when Sumerians stamped their account К 

Psychologist the spasmodic way ia hich the whole project has developed is itself a problem of singular 

ant The first effective adding machine was built by Blaise Pascal in 1642 at the age of 19, to 
is fath as : of the French customs: 1 

iM ie пөш Leibniz exhibited before the Royal Society of London an 

[Ме add but multiply. Nevertheless, two hundred years later 


0 i д 
iw Quite obvious that he had successfully hamn 


igital computers à 
are built. а 

д c man of means. Though he had left 

с arles Babbage was the son of а banker, апа {Шү af the Royal Society at 22, and at 36 


Иде without sitting for the Tripos, he became he Chair once held by Newton. This 


Was elected t7 hip of Mathematics, tl 
Office he to the Lucasian Professors YP n'siyi a single lecture. Babbage was a highly original 
inve retained from 1829 to 1839 without ETES er THE first speedometer, ‘ skeleton keys for 


上 
entor. He is sai ised the cow-ca Д 
Unpickable flocks A etd sodas for solving codes and ciphers, а valuable method of detecting 


! Quite re " ‘ D ronic office which works out the weekly wage-packets 
gheir. 7.000 pent xpo ctore pus А е in the various teashops, and is rented in its spare 
сопс 19 Outside bodies. ^ Io cael analys s day by d automatic control, a vice-president of the ford Motor Company has 
191164 the word * ies. To cover Bioline es ‘Conference on Automation and Industrial E е lopment, New York, Мау, 
Indd* n this automation ’ (cf. R. К. on Human Relations їп Industry, set up by the Department of Scientific and 
ustrial Research ш the Joint Commit eh Council has sponsored a small pilot research at Cambridge on automatic 
Subject in industry: mu мешен Rs а ROWE through rhe pres РЕР. has issued an instructive pamphlet on the 
ad We Lovards an Automatic Factory Planning, ОП, Survey of the Mediterranean, fell in with a Spanish vessel, 
Smy, during th that Captain Smyth, R.N., d the navigator with a handsomely bound copy of the А/тайас. Captain 
th wha ® exchange of courtesies, presente th quite safely: the Spaniard was never heard of again. 
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climatic cycles based on rings of tree-trunks, and finally the occulting lighthouse. His analysis 
of post office economics led to Rowland Hill’s introduction of the penny post; and he even anticipated 
the experiments of Professor J. B. S. Haldane by spending ten minutes in an oven at a temperature of 
265° Fahrenheit. А . 3 

To cut the cogs for his computing machines with sufficient regularity, great improvements had 
to be made in lathes and precision-tools; and in workshop methods alone his achievements, as Dr. 
Bowden remarks, must have more than justified the grant which the Government gave him towards 
constructing his earlier models. To feed in the figures for his computing mechanisms to work with, 
he hit upon the idea of using the punched cards that had just been invented by a French weaver, 
Joseph Jacquard, for controlling his looms.! Unhappily, and not always through faults of his own. 
hardly any of the machines that Babbage designed and started were actually completed; dexterous 
as he was in handling tools and formulae, he had no fiair for explaining his ideas to others and was 
wholly devoid of the tact and good temper needed to cope with government officials and scientific 
critics. He died, a disappointed visionary, in 1872; and his workshops were found littered with what 
a contemporary described as “ the wreckage of a brilliant career, and bits of machine which will 
doubtless remain for ever a mere theoretic possibility ". : E 

In his own eyes his greatest feat was the invention ofa new algebra to describe the logical relations 
between the various parts of his machinery. The symbolism proposed was highly ingenious. But 
Babbage himself was too engrossed in applying his principles to publish any clear or comprehensive 
exposition of them. The best account both of his methods and of his machines we owe to another 
extraordinary personality, Ada Augusta, Countess of Lovelace. н — 

Early in 1815, Lord Byron married "a young heiress and savante " named Miss Milban a 
In his love letters he calls her his * Princess of Parallelograms ". In Don Juan 2 she figures as " the 
winsome Donna Inez, Don José’s learned wife ". ** Her favourite science was the mathematical. 
A year later “ Don José and his lady quarrelled ", and ** were for ever parted”. Meanwhile a chil 
had been born— 

Ada, sole daughter of my house and heart, 
A child of bitterness, though born in love.3 


“ My voice ", he says elsewhere, “ shall with thy future visions blend, To aid thy mind's develop- 
ment." And indeed her mind apparently inherited both her father's literary talents and her mother $ 
mathematical abilities. As a girl she studied under Augustus de Morgan, Professor of Mather 
at University College, London. One day his wife took her to see * Mr. Babbage’s engine ’. Ww ШЕ 
the rest of the party " (says Mrs. De Morgan) “ gazed at the instrument with the expression zavan 
are said to wear on first seeing a clock or a gun, Miss Byron, young as she was, saw the great beauty 
of the invention and understood its working."4^ A few years later she married William King, afte 3 
wards first Earl of Lovelace. She and Babbage remained close friends; and of all his contemporaries 
she alone seems to have really grasped what he was trying to do. " 

In 1840, Babbage gave a series of lectures at Turin. They were attended by L. F. Menabree 
(an officer of the Italian Military Engineers and in later years one of Garibaldi's generals) who yro 
out a detailed * Sketch of the Analytical Engine invented by C. Babbage’. Lady Lovelace translate? 
his paper into English, supplementing the description with valuable mathematical notes and imi 
hand explanations, and drawing up an intricate chart or * programme ’ for computing the Bernon a 
numbers by way of illustration. Later she and Babbage worked out an elaborate mathematice 
system of betting on horses, and endeavoured by its means to raise funds to build a still better machin : 
Both lost heavily. Shortly afterwards, at the early age of 37, she died, and was buried beside her 
father at Newstead Abbey. Dr. Bowden has had the happy idea of reprinting the whole of P 
memoir in an Appendix of sixty pages.5 


E cau of 
1 A few years after Babbage's death the same idea occurred to Dr. Hollerith, then in charge of the American Burca үр 
the Census. Although by law а U.S. census had to be taken every ten years, the work had grown so rapidly tha rter 
existing methods it would have taken twenty years to complete the requisite calculations. Hence the ' Hollerith 50 
with which every psychologist is nowadays familiar. inlication 
2 The story is told in the first thirty cantos, as an awful warning to any who would “ wed a walking multipli: 
table ”, with the final moral: 
Oh, lords of ladies intellectual, 
Inform us truly, have they not henpeck'd you all? 


^, "T" > Н 816 
The footnotes to Murray's edition of 1837 explain the rather cryptic allusions. Ses also * Fare Thee Well, March, 181 
the manuscript of which, says Murray, “ is blotted all over with the marks of tears lations 
3 Childe Harold's Pilgrimage, MI, i, cxviii. It is curious that no editor, commenting on the poet's repeated specu 


about the “ destiny of his only child”, ever tells the reader what her destiny actually was. ds 
‘See this Journal, IV, 1951, рр. 198 f. As Dr. Bowden remarks, “ the Attitude of the average intelligent man towar 
mathematical devices has changed very little during the last hundred years”. he 
> Babbage himself considered Lady Lovelace's account of his machine to be the best available, and noted that she 
corrected an error in his own recurrent formula for computing the Bernouilli numbers. 
A passion for mathematics, horse-racing, and foreign adventure appears to have run through her family for Wot, 
three generations. Her daughter, a brilliant mathematician and Arabic scholar, married Wilfred Scawen Blunt, the Pith, 
and sought to introduce Arab blood into the race horse (see The Authentic Arabian Horse, 1942, by Baroness Went 50 
her granddaughter) „When in а broadcast talk I mentioned Lady Lovelace and her mother and daughter as members fh, 
remarkable mathematical family overlooked by Galton, I received a Jetter from another of her descendants telling "puting 


IA living members were equally gifted, and that one happened to be working for a well known manufacturer of co™ 
machines. 1 
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ws The subsequent history of computing machines from the days of Babbage to those of the earliest 
голе computers is dismissed rather briefly i „and the remainder of Part I consists in a compre- 
Ü nsive and lucid account of the internal organization of the modern high speed instrument and of the 
ireutt components and other mechanisms on which its working depends. 
có Part II is devoted to the description of existing machines. Nearly a dozen different specialists 
шише a chapter each on the latest types now working in this country and in the United States. 
tine British psychologist in particular will be grateful for the full details about the construction and 
capabilities of the machines available at Birkbeck College, Imperial College, and the National 
hysical Laboratory. 
1 „The third and longest part of the book is concerned with the applications of such machines. 
t discusses at some length the services which they have rendered, or might render, in government 
Offices, in business and commerce, in the fighting services, in crystallography, meteorology, dynamical 
astronomy, and in various branches of scientific engineering. To the psychologist the chapter of 
Breatest interest will be that dealing with the solution of logical problems. 
d Once again the history of the basic idea goes back to Leibniz; in a well-known passage, he de- 
ared that, with the methods that he advocated, “ wrangling would be as unnecessary between philo- 
Sophers as it is between accountants: they would simply say: ° Let us calculate ' ". То take his 
own example, suppose * man’ to be represented by the number 6; then the prime factors of this 
3 umber may represent ‘ rational’ and * animal’: thus the proposition * man is a rational animal" 
Will be represented by analysing 6 into its factors, 2 and 3. And, with more complex subjects, an 
ars combinatoria will enable us to construct and compare all the relevant combinations of such factors. 
n this way logic will become a kind of algebra. н 
M What Liebniz only dreamt of, George Boole actually carried out. In 1848, Boole (professor of 
Mathematics at Queen's College, Cork) published his Mathematical Analvsis of. Logic, which was 
DUST expanded into his more celebrated Laws of Thought—a book that Bertrand Russell once 
ailed as the most momentous contribution to formal logic ever written. Its object was ”to express 


g е operations of reasoning in the language ofacalculus". From this it was a comparatively simple 
tep to invent an apparatus that would apply the calculus. The first machine to beactually constructed 
ther of De Morgan's students at University 


c the * logical piano ’ designed by Stanley Jevons (ano! 0 at L à 
ollege and later Professor of Political Economy there): a woodcut of it forms the frontispiece to his 
Ook on The Principles of Science. The instrument, as he tells us, was inspired by the work of 
ee Morgan and Boole. Its operation assumes the logic of dichotomous classification (X and not-X, 
С.); and the essential processes have an instructive analogy with the factorial analysis and factorial 


Synthesis of qualitative data. eS А 
th The statistical psychologist often finds it difficult to persuade his critics that the algebraic methods 
КЕ he adopts are merely an application of rigorous logical principles: perhaps a more effective line 
Of explanation would be achieved if he argued in the opposite direction, and tried to show, as Jevons 
es, that complex logical problems, starting with purely qualitative data, can be efficiently solved 
Y the application of mathematical formulae. Let us therefore begin by comparing the procedure 
Proposed by Jevons with that adopted by the statistical psychologist.) This will help us to a clearer 
o derstanding of the principles embodied in the more modern logical machines described by Dr. 
еп. 


The logical reasoning which Jevons' machine is capable of performing is based on the principle 
erms of (1) conjunction and (2) negation: thus, 


that all purely logical relations may be expressed in t 
, Either A or B ^ may be expressed by saying * The conjunction of Not-A and Not-B does not occur °; 
ГА, then B^ by saying * the conjunction A and Not-B does not occur °; and so on (Jevons, loc, cit., 
m 73 f). And from this it would evidently follow that every statement, so far as it involves logical 
lations, can be expanded in a logical sum of logical products (the so-called * Boolean expansion °). 
Y 1 akin to matrix multiplication, which consists 


ch: an expansi 
Я pansion сап be computed by methods 
Essentially in forming product-sums. However, instead of the somewhat abstruse examples borrowed 


in 41 For this ji i i efer to David Baxandall's excellent article on * Calculating Machines " 
in the 局 his intervening period the reader may refer to David | s PERE йкел oth By гй 

Eney s 8 ully exploited the principles set forth by Babbage 
"icyclopaedia. Britannica (14th edn., 1929). A machine which ly explottes ihe sequence-controlled calculator, 


Was not act 
k асай, il à «d years after he had formulated them, п ni м ato 
known Rs SU 2 ii hundred уе Professor Aiken of Harvard in 1939 and put into service a few years later; its 
eut ark 1, clesignee 95 ot driven through electro-magnetic clutches which in turn are controlled 
1 ic Numerical Integrator and Calcu- 


by eating elements consist of mechanical counte: 
lator (ifo mechanical relay circui t all-clectronic computer was the Electronic ， pi 
Tor ENIAC) buna i GMT of En first allele ay of Pennsylvania, 1946: it contained 18,000 valves and 
been Seana ge stil ee iue the Harvard Mark I. Still more ingenious models have since 


relay: H ines ч у 
5, апа works several hundred times faster O Kerg at Philadelphia, and affectionately christened EDVAC, 


een deis 
EDSACSigned by Professor von Neumann and 

on inelegans Specimen Demonstrandi in Abstractis " 

ted by Jevons in the Preface to the 


"n TES OHNIAC, and MANIAC. is 
Z's proposa ined i t 5 Е 
Secon on, Leibniti Opera, AD E n. uctive passages are quo! 
tion of his Principles of Science (1877). a Р 1 i Жопа whi 
The «fron of his Principles of Science (1877) «o improv lative knowledge by mechanical operations ”, which 
fra forimproving speet eas presumably suggested to Swift by Leibniz's 


er rare tract, entitled * N 


The most instr 


the 


Profess tM or engine’ with forty iron handles ошар 

des Professor de : Pny of Lagado *, w e " 

*Monstrations monstrated to Gulliver at the grand aca ever, Supposed it to be based on Ramón Lull's Ars Magna, 
P RON Soci г GIL The Principles of Science, 1874, pp. 108 f. а 


+S. Jev i 97 f. ; 
seS ons, Phil. Trans., CLX, 1870, pp. 497 1; ; й -185. 
Cf. © The Factorial Analisis of Qualitative Data ’, this Journal, Ш, 1950, рр. 166-18. 
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by Jevons from physics or chemistry, let us start with a concrete problem sc simple that its solution is 
obvious almost at once.! 

Just before the war, a local education authority gave the following instructions to a recently 
appointed psychologist. *' Pupils who are innately deficient in general intelligence or are backward 
in the essential subjects of the elementary curriculum by three or more years are regarded as educa- 
tionally subnormal. All other pupils will be regarded as normal and will attend the ordinary 
elementary school. Pupils who are subnormal must be recommended for transference to the special 
school. The term ' pupil’ here includes any boy or girl between the ages of 5 and 14 who is physically 
fit for classroom instruction." The psychologist thereupon asked himself: “ how precisely do I 
classify the children referred to me, and which of these classes am I to recommend as fit for a special 
school or for an elementary school respectively ? ” 

Confronted with a problem such as this, we start, as Leibniz and Jevons would have us do, by 
expressing the relevant qualifications in terms of four components or * factors: (i) G, a positive or 
* general ’ factor, denoting the whole universe or genus of ° pupils ^; (ii) X, a first * bipolar ’, denoting 
suitability for a * special school ' or the reverse; (iii) A, another bipolar, specifying presence or absence 
of innate ‘deficiency’; (iv) B, a third bipolar, specifying presence or absence of * educational 
backwardness’. Thus, in theory, the entire genus will consist of 23 = 8 conceivable classes of 
persons which we can number from i to viii. " 

We now require symbols to indicate the positive or negative aspects of these components. With 
literal symbols, we can use (with Jevons) X and x, or (with Woodger and others) X and — X, or (with 


Welton) X and X. With numerical symbols we can enter in the row or column reserved for the com- 
ponent in question either (a) 十 1 (for the positive term) and — 1 (for the negative term) or again (b) 1 
(for presence) and 0 (for absence). It will be seen that these numerical alternatives correspond with 
the factorist’s choice between a bipolar factor and a group factor. In the electronic machine the 
alternatives may similarly be represented in two different ways. (а) They may be indicated by a pair 
of contrasting signals, the mere absence of a signal conveying no information: this is the principle 
adopted in the old Morse code machines and in the machine built by the Telecommunications 
Research Establishment at Malvern. Or (5) they can be represented by a relay or a switch which i5 
either * on’ or * off’ : this is the device now adopted by most electronic machines. 

In my previous paper I used the second type of notation (0 or 1). Here I shall attempt to show 
how the first can be used. Adopting this convention we can specify the 8 classes by a ‘ bipolar 
matrix, such as the following: 


Classes of Persons i ii 


v vi vii viii 
Factor G +1 +1 +1 +1 +1 +1 1 
» X +1 +1 1 一 1 一 一 一 ! 
ü A +1 +1 1 +1 + -i -i 
s B 十 ~ ME 


With the alternative notation we should obtain a matrix in terms, not of * bipolar factors °, but 


of * group factors *. But the bipolar matrix can evidently be rotated to the group factor form 
means of the following pre-multiplier: 


Bipolar Factors 


G x A B 

G 1 0 0 0 

Group Factors 2d 1 é 9 p 
B’ i 0 0 $ 


Out of the eight conceivable classes deducible from these three * bipolar factors’ the regulations 
regarding special schools exclude a certain number by imposing the requirement X = A v B (i.e. t 19 


X must be either A or В). The next step, therefore, i i d by the 
: : s , is to d а e covered D» 
requirement ‘ either A or B’ , is to determine what classes аг 


Ц T - The requisite operation can be carried out by first rotating the ЫЇРО = 
matrix as above described, and then applying the summation n 0 0 o] to obtain 


0100 
0 0 


€ 
ae w the usual rules, except 2 that (十 1 1) = +1: (the procedur 
is in some ways analogous to ‘unweighted summation in facic). Ba iy ^ Fhe premultiplie 


e th -Or- Н ie 
= JO 2 ONO Брат rows. ree one-or-zero rows back to bipolar form, thus У 


1 КҮҮН 5 сог 
" fools See Бето нар “атр taken in а non-exclusive sense: as Mill puts it, “ when we say X is ‘either a Knave е 
therefore seems unfortunate, since it тайну жле с iios, „Thus our ‘ group factors? may overlap. Dr. Bowden's exar 

2The apparent paradox has a ран sies Out the joint possibility, whereas his machine obviously does not. — all 
in classes i, ii, and iii are A ^; с] je qu logical explanation: +1 in the new row A and cols. fiz, and iii means | ja 
classes i, ii, and jiii areleither A or B ane Tow B means “all in subclass i are В”. The logical sum states denote 
Der on-exclusive sense). And in each case we must again use ‘+1’ t0 
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For the last step we note that the relation of equivalence is expressed by the operator [4, 4]. 
On using it to pre-multiply the two bipolar rows just obtained, we get a single row, whose values 
contain the solution to our problem, namely: 

_ Subclass i iii iv v vi vii уй 
Final result 1 0 0 0 0 -1 
These results indicate that, with the requirements imposed, only the first three classes, GX(AB), 
GX(AB), GX(AB), and the last, GX(AB), satisfy the regulations: the former, as we see, specify 
those persons that are fit for a special school (X), the latter, those that are not (X). The four remaining 
Classes (those marked zero) are forbidden by the regulations, either explicitly or implicitly. 

„ In practice most readers would doubtless solve an elementary problem like this almost intuitively, 
Without recourse to symbols and conscious inference.! But with the blind working of an artificial 
apparatus a step-by-step routine is unavoidable. To demonstrate the possibility of such an instru- 
ment, Ferranti, Ltd., have built an electronic contrivance capable of dealing with three components: 
at Harvard a larger machine will deal with as many as twelve.? 

With the method I have described, it will be seen, the last step of all involves scanning the final 
Tesults for each possible combination. Three components yield only 8 combinations; but 12 would 
yela as many as 4,096; and twenty over a million. Thus the number to be scanned rises swiftly 
t an amount that is quite impracticable even for the most up-to-date high-speed appliances. For- 
early; when the problem is of great complexity, it is often sufficient to obtain just one positive 
Olution, without discovering all. In that case an alternative principle may be exploited—that of 
ieed-back. Constructed in this fashion the machine can work 50,000 times faster; and its behaviour 
5 virtually that of what is known as a discrete Markoff process.3 Once again Ferranti, Ltd. have 
“signed à machine based on this principle, which is also capable of furnishing every solution, when 


the probi ra 
em is of a comparatively simple type. d " i 
OSt of the logical problems handled by such appliances are, like the example I have used, 
Problems to be solved by deduction: the data of the problem offer us the relevant classificatory 
qj ponents or ‘factors ’ ready to hand. But in the natural sciences such classifications have to be 


Scovered first of all. I i imi inductive stage constitutes one of the special 
. In psychology, this preliminary inducti gi t > 
ks Of factorial EER E of factor analysis as distinct from factor synthesis. Itis to be hoped, 
60101, that in the near future both designers and programmers will devote attention to these more 
Ing demands.4 

he machines i have hit the popular imagination are those that play games. 

As lon chines that seem chiefly to have hi popular i LS prem 
Ж ing automaton which was able to ‹ efeat 
Б ago as 1769, Baron Kempelen exhibited a chess: playing б текш 


the Е ibed.i i Alan Poe; 
mperor Napoleon. It was described in some detail by Piera esiet Boa i concealed 


the stor ra 
ins; Y, the hoax wa: asked when somebody shouted ° Fire’, А E 

Inside str rs nn i aratus which could play * noughts and 
Crosses’ O AOE  Babnege he ranching called ‘ Nimrod ’ at the game of 


nd many r s may have tried to й 1 rod 

descr; p hen it was У eral at the Science Museum during the Festival on Pie ре озова 

well es computers that will play both chess and draughts, the former ra д 

Ds; mh engineering, physiologists, русаш, 
успојові г i i i behaviourists and philosophers 
h Ogists, and „philosophers (chiefly psychologists who ше. b Saee we are now 
Olly justi i i i ighly developed physical machine. 

H Y justified in regarding the human mind as no more than a Rig Ges ОЛАН 


nceforth we i à 
i nse wit 3 dubbed. 
со Snine’, a я be made for differences in complexity : ware е most elaborate 
Cells шег Contains no more than 5,000 valves, the Оңа + brain шош ү аиа AY ОЧЫ 
ib OOut six ti i the globe). at being SO," ed, 
Attribute the obvious aie odes the (NU to the inequality in initial equipment. In short, 
Ind ang TOUS. disparities SE a conclusion which raises the per- 
exi: е 
TE Questions discussed by Dr. Bowden in t 


€ begi i rmances of the most 
most “gins by comparing the epe хта поа Г instructive sur 


е 
flicient human computers, and 


! Th i 
ud е read Р ively simple 
bonda of Componentan iae Готедоіп eder P bers of the public could ЕН and all the bond-holders аге 
iret Dares, or both, i all bond 
ION атоо. реет (узат 
Cf. ument is capable of inductive as We. 
Feb, 477 W. Mays and D; C. Prinz, “A Relay Machine 


Mise: 
9 Usinnis Journal, IV, 1951, p. 194. 


i ve factorized a 7 x 7 correlation matrix 
in sin, r. Neuhaus have fac! 
Jo Pout 16 е method of principal components, 


Dr. Wrigley and Dr: Matrix with the Ordvac Computer’, this 
izati the Burt-Pearson Ma ^ ~ reel hey 
ojimple sw 1952, pp. 105-108). But no Ко a tempted solution t 人 
25,1952, pp. 105-108). y empto vided fa j llus x [ 
he rd ultiple repression (sic) analysis’, but does not enter into detailed 
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and methods of Professor A. C. Aitken, F.R.S., and Mr. William Klein, of the Mathematisch Centrum, 
Amsterdam. On the whole, it would seem, the machine has the advantage as regards complexity 
of problem, memory for figures, and speed of solution. These happen to be the features in which 
* calculating prodigies’ themselves outdistance the ordinary man. On the other hand, the human 
expert possesses at least two advantages over the machine: first, he can choose or adapt his method 
according to the accidental peculiarities of each problem, and thus improvise short cuts in a way that 
is quite beyond the scope of any machine; secondly, he is endowed with a power of * schematic 
apprehension °’, which no one has yet succeeded in imparting to an instrument: whereas the Domen 
calculator can grasp and manipulate whole patterns or Gestalten, the machine is compelled to won 
explicitly step by step.! Prima facie this suggests a difference of quality rather than of mere complexity 
or degree. - 

To examine the pros and cons of this ancient controversy would take us too far afield.? But 
itis worth while hearing the verdict of the engineers. As Dr. Bowden's quotations show, most of 
them are inclined to deprecate “ the attempt to infer complete similarity from partial similarity, 
however striking ". He himself points out that, although the engineers can produce a mechanized 
Barlow (who computed tables of squares and powers), and a mechanized Briggs (who compile 
tables of logarithms), they have not even hinted at the possible production of a mechanized Napier 
(who invented logarithms). And at least one impartial critic, speaking as a philosopher as well as 
a psychologist, emphatically agrees. ‘* There was once ", says Professor Mace, "a monk who 
invented a machine which could prove the existence of God. The monk, however, was cleverer 
than the machine: for, up to the present, no machine has invented a monk who could prove qs 
thing at all."3 Perhaps we may allow Lady Lovelace to have the last word (the italics are ы 
own). “ The analytical engine has no pretensions to originate anything; it can merely do what W 


order it to perform.” 
CYRIL BURT. 


1 Professor Aitken, in the course of a broadcast in which Dr. Bowden and I were allowed to question him, remarked 
that the process seems to imply ** a compound faculty, which, so far as I am aware, has never been exactly descri pole 
He suggests that “ the analogy of the musician might help: a violinist concentrates, when he is playing, on the tune as à W 
not on the separate notes or the fingering—except when he gets momentarily into a tangle ”. cV. 1955, 

e discussed the question in greater detail at the close of a recent paper in Brit. J. Educ. Psychol., XXV, 19°° 
pp. 18 f. 3 

3 Preface to W, Sluckin's Minds and Machines (Penguin Books, 1954), р. 9. The reference is, I take it, to the 
recorded logical machine, the Ars Magna (c. 1300) of Ramón Lull (a Spanish missionary and mystic who hoped t 
proofs would convert the Moslems, but he was stoned to death by them at the age of 80). Francis Bacon, however, 
nounced that it was a methodus imposturae. 
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THE DETERMINACY OF FACTOR SCORE 
MATRICES WITH IMPLICATIONS FOR 
FIVE OTHER BASIC PROBLEMS OF 
COMMON-FACTOR THEORY! 


By LOUIS GUTTMAN 
Israel Institute of Applied Social Research 


I. Introduction. II. The Problem of Determinacy. Ш. Supplementary Problems 
I. INTRODUCTION 


ssi l. Problem. Common-factor analysis—in the sense of Spearman, Thurstone and others, 
ACIES by considering the scores of a given population of N individuals on each of л quanti- 
ive variables from a given universe of content. Its goal is to express the observed scores 
as linear combinations of scores on common- and unique-factors. 
If It is typical of factor analysis procedures that they approach their goal only indirectly. 
g is the observed score matrix, a direct analysis would consist in finding factor score 
Matrices y and E such that 
é= Ante a) 
Where A is some real matrix of common-factor loadings. Instead of a direct resolution into 
actor scores such as (1), indirect procedures are used, which pivot on the observed correlation 
auch d ен К. Real matrices A, L, and U are sought, where L is Gramian and U is diagonal, 
al 
R= ALA’ + U:. (2) 
alw As is well known, unless further restrictions are imposed, infinitely many solutions 
kd ice exist for the right member of (2). There are different schools of thought as to the 
рос ns to be imposed on U, especially on the rank of R—U'*; this is the well-known 
Co lem of * communalities'. There are also different schools of thought as to what 
nditions should be imposed on А and L; this is the problem of * rotation of axes ’. Irres- 
poctive of their differences, each school in practice seems to regard its main job as completed 
criter en £ when A, L, and U have been computed for (2) to satisfy that school’s particular 
or communalities and rotations. А РР ; 
The purpose of t paper is to explore the extent to which the indirect analysis 
Of the scores ia Түш шушы, to a direct analysis of the type (1). Part of this problem 
notaheen Studied previously—from a somewhat different point of view ас other writers, 
en ably E, B, Wilson, Godfrey Thomson, and Walter Ledermann [see discussion and refer- 
a CES in 15, pp. 371 f] In the present paper, we shall establish the existence of solutions to 
J given (2), а ida set of and sufficient conditions for the construction of all possible 
Solut; ‚апа a set of necessary a d examine in detail some basic questions 


This will enable us to go further an 


ncernin ap : = lysis. 
8 the meaning of common-factor analysis. 
و‎ TUNE: The Six Problems. The reason why (2) has entered 


t ¬ Sufficiency Versus Necessity: UE 
= Picture of factor AlS is E itis a necessary condition for (1). If A, y, and ¢ are 
unien Such that (1) is satisfied, if L and U? are the covariance matrices of the common- and 
eachue-factor Scores respectively and if the unique-factor scores have zero correlations with 
other and with all the common-factors, then (2) necessarily follows. 
ted grant-in-aid from the Behavioral Sciences 
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The Determinacy of Factor Score Matrices 


Necessity is not the same thing as sufficiency. Computations concerned only with (2) 
yield only matrices А, L, and U?. To reach the goal of the factor analysis requires one to 
ask further: does there exist a pair of score matrices, 7 and £, to satisfy (1) with the given A 
and with the covariances specified by the given L and U? ? If yes, how many such pairs 
of score matrices exist for the fixed А, L, and U?? That is, to what extent is (2) sufficient 
for establishing a solution to (1)? 

This existence problem turns out always to have an affirmative answer, but also to have 
too many affirmative answers, especially when U? + 0. Furthermore, for finite л, these alter- 
native answers can differ widely among themselves. We shall establish conditions under 
which (when № = oo) there will be essentially but one answer in the limit as л 一 o0. 

The multiplicity of solutions for 7 and Z when л is finite—even when A, L, and U* are 
fixed—has important implications for the psychological meaning of common-factor analysis, 
as well as for the computing procedures conventionally used. If we denote as * Problem I 
the determinacy of 7 and £, solving Problem I throws light on the five further basic problems 
listed below. These are discussed more fully in the last part of the paper. 


Problem П. Communalities. The problem of the * unknown communalities '—or equivalently, of 
the choice of U2—turns out to be related to Problem I. We show a unique solution is definable 
under certain conditions as n — ©. 

Problem Ш. Meaning of factors. It has been thought by many that the meaning of common- 
factor scoresin 7 could be ascertained from 4 in (1), or factor variables in could be named according 
to the observed variables in £ which have high loadings on them. This may be illusory in the light 
of the answer to Problem I, for 7 itself can have widely different simultaneous solutions no matter 
what the structure of a fixed 4 may be. 

Problem IV. ‘Inverted’ factor analysis. Merely considering the case where N = co, as for example 
when the variables in £ are continuous, raises questions, which may not have been previously apparent, 
in respect of the proposal to ‘ factor’ people rather than variables. 

Problem V. ° Second-order’ common factors. When L in (2) is not a diagonal matrix, some Te; 
searchers have proposed * factoring’ it in turnlike R. Solving Problem I shows that ‘ second-order 
scores may be even more indeterminate than 7 and ¢, especially when the number of common- 
factors is small, so their use in the quest for parsimony may be quite illusory. А 
Problem VI. Rotation of axes. If т) by itself is indeterminate, any rotation into an 7 remain 
as indeterminate, no matter what A may result from 4. Seeking meaningful factor scores merely РУ 
a correlational analysis, as implied by rotations, may be fruitless; for this further observations o 
experiments beyond £ and R may be essential. Since many fundamental and useful properties О T 
common-factor space can be shown to exist and can be studied without specifying a particular Жү 
reference axes [8, 9, 11], the rotation problem may have diverted attention from what can rea y 
be learned by a correlational analysis alone. 


We shall actually solve Problem I for the case where the covariance matrix U* is not 
necessarily diagonal, or where the so-called B-law of deviation holds [9, p. 308]. The 1? 
striction of U? to a diagonal matrix turns out not to affect the existence and determinacy 
of possible solutions. Of our six problems, only Problem II is devoted exclusively to t 
special case of a diagonal U?. 

The mathematics of factor analysis has usually been cast in terms of matrix algebra. 
More generally appropriate is the linear algebra of an abstract Euclidean vector space W 1 
we shall adopt in this paper. 


s just 


II. THE PROBLEM OF DETERMINACY 
Se 


3. Notation for Score Matrices. Let x; denote the jth variable selected from the unive d 
of content, and x; the observed score of individuali on ху. It has been customary to rego. 
the ху as elements of a finite real matrix of order nx №. However, since this тр 
that the observed variables are discrete for the given population, we shall prefer instea г 
use a notation that will allow for both the discrete and the continuous cases, for finite and E 
infinite N (countable or not countable). To this end, we shall define the score matrix ап 
its transpose £' to be respectively , 


X1 3 
geo, E^ = p м i 


Xn 
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Rd is, € is a single column with statistical variables as elements, and £' is the same elements 
119 M asarow. We define the inner-product of two elements x, and x; to be their covariance, 
" enote this by x,x;. While neither x, nor x; is a real number, their product x,x; is. 
particular, the variance of x; is XX 
ae With this convention as to inner-products, we can form the product ££' by the usual 
d of matrix multiplication. Clearly, this is a real covariance matrix of order п; we shall 
enote it by R: : 


| R= & = [х„х;]. (4) 
It is customary to assume all observed variances to be equal to 1: 
x= 1 G = 1, 2,..., n). (5а) 


In this сазе, R in (4) is the correlation matrix for the variables in £. Convention (52) 


1 Н = * Py - 
JS not essential to this paper; but we desire to restrict ourselves to the case where no diagonal 


clement of R vanishes or all observed variances are positive: 
xx; > 0 G= 1, 2,...5 п). (5 b) 


dius (Sa) can always be attained when (5b) holds (by dividing through by standard 
or iations), the reader is at liberty to interpret A in this paper either as a correlation matrix 
Merely as a covariance matrix, according as he wishes to adopt (5 a) or only (5 b). 


Е consequence—though not the most important—of our inner-product convention is that the 
no es of measurement of the х; need not be specified ; the means are arbitrary. There is, however, 
EE R generality in assuming the means to be zero for the problem of the present paper. 
eleme is finite, a score matrix such as £ in (3) can be regarded as a partitioned real matrix, each 
is the nt representing a row of N real numbers. Our rule for post-multiplication of score matrices 
by X Same as the usual one for partitioned real matrices, except that the usual result is divided through 
modif nec we also wish to deal with variances and covariances for Nes co, the usefulness of our 
of Prod Post-multiplication rule will be apparent; it allows for general integrals (Stieltjes-Lebesgue) 
conve iud as wellas for ordinary product-sums. Thisis the basic motivation for our inner-product 
Certain ion. Here we shall always consider п finite. Although we shall also be interested in 
n limits as п co, any given value of n is finite. 

of 4. The Vector Space Concept. When dealing with score matrices, in view of our definition 
an inner-product, not all rules for real matrices (i.e., matrices with real numbers as ele- 


ments) need apply. 
For 72 
finit; example, we d t define the product ££; but we s 1 
nition. On the other Pand, if M is aay теа! matrix of order mx n, then the product Mê is defined 


Y the : J n Ae 

Ordinary | i Id lumn score matrix of m elements; linear 
с ау duct to yield a new column s z 
ni pou EE manx pro ў, are two column score matrices of n 


binati : s den д 
el ions of variables yield variables. Similarly, if œ and £ co ; 5 
T ents each, then the b eif is formed by the ordinary law for addition of matrices to yield a 
Scor Solumn score matrix of n elements, for the sum of two variables is a variable. In short, column 
таге matrices are to be pre-multiplied only by real matrices and post-multiplied only by row score 
m T (in the latter case, the product being а real matrix of covariances). The remaining rules of 
Manipulations will hold for the needs of this paper. 

$m In effect, we are regarding our statistical variables as points or elements of a function 
се (cf. 13, chap. V]. Such a space is a special case of a Euclidean vector space [1, p. 189]. 
vam (length) of each of our elements is taken as its standard deviation, and the inner- 
m of any two of our elements is their covariance. аР — 
The a| opriate to any finite dimensional (n finite 

Eu lid lgebra we need turns out to be completely арры ODIO: the algebra as for the general 
Lemmas and theorems will usually 

When we wish to analyse the 
итп of n vectors. For our statistical case, 


: Ca: н T 
$ is se, we shall speak, for example, ipie im "The adaptability of the notation of matrix 
згч t 一 to columns of vectors from any Euclidean 


he same exten 
tion usually adopted for Euclidean vector spaces 
hezo vector, denoted by 0, is meant that vector such 
The convention is usually made that all vectors with 
fer to denote a vector with a zero norm by 0*, and 
tatistical case, a Zero norm means a zere standard 
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hall also have no need for such a de- 


Concerns departure we shall make from a € 
that 45 the concept of a zero vector. By t 
Sero norme ,* for all vectors x in the Space. 
Allow it zs Shall be set equal to 0. We shall pre! 

9 be possibly different from 0. In our $ 
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deviation, or a variable 0* is constant almost everywhere (i.e., for almost all individuals), or equals 
zero almost everywhere if the mean is zero. It may be important in some cases to distinguish between 
a variable which is identically zero and one that merely has a zero variance, and we shall leave room 
for this distinction. 

A vector with a zero norm will be called here a null vector. Clearly the vector 0 is a special case 
of a null vector. What all null vectors 0* have in common with the vector 0 is, that if x is any vector, 
then the inner-product of x with 0* vanishes: 0*x’ = 0. This follows from Schwarz's inequality 
[1, р. 190]. A column of null vectors will be denoted below by 0. 

А distinction between a Euclidean space and a Euclidean vector space is that in the former there 
is but one null vector, namely 0. As is customary, we shall use the symbol 0 to denote several 
different things: the real number zero, the zero vector, a zero matrix of real numbers, and a column 
of zero vectors. The context should always make clear which of these is implied. 


5. The Basic Hypotheses of Factor Analysis. Given an observed score matrix £, part 
of the basic hypothesis of the Spearman-Thurstone factor theory is that hypothetical score 
matrices 7 and ё exist, as well as a real weight matrix A, such that: each y; of has a correla- 
tion of zero with each z; of б, or 

16 = 0, (6) 
and (1) is satisfied. 

If A is of order 1 q, then 7 is composed of q variables y, which are called common- 
factors. б necessarily has n elements z; in (1), although some of these may have zero 
variances; these п variables we shall call here deviant-factors. And, following a terminology 
suggested in [9, р. 308], we shall say that any 7 and £ satisfying (6) obey the -law of deviation. 

The covariance matrices of the g common-factors and the л deviant-factors will be de- 
noted by L and Us respectively: 

L=m, U? = ir @ 

The Spearman-Thurstone theory requires further that ¢ follow the y-/aw of deviation, 
in the terminology of [9]. This means that z,z; = 0 whenever g + j, or U? in (7) is à dia- 
gonal matrix. In such a case, if we denote the jth main diagonal element of U? by 1, Wê 
can use the conventional notation for a diagonal matrix to write: 


(8) 


0° = [и], u$,...,ui]. 


When both the В- and y-laws are satisfied, 7 and ¢ are said to obey the 8-/aw of deviation 
[9, p. 308]. Following Thurstone, we shall then call z; and и? the unique-factor and uniqueness 
respectively of xj. 

The solution to our sufficiency Problem I turns out to be essentially the same for the 
general B-law as for the more specialized 6-law. So we shall not restrict U? necessarily 
to be a diagonal matrix. н 

If L is a diagonal matrix, then the yx are said to be orthogonal to each other; otherwise 
they are called oblique. We shall treat the general case of any L, diagonal or not. 


It has been customary to assume the variances of the у all to be equal, and hence to set them all 


equal to 1. In some recent developments [9, 11], it appears desirable to express the ук with uned e 
variances. So we shall treat the general case of arbitrary variances, and not necessarily assume 
main diagonal elements of L all to be equal, nor in particular equal to 1. 2 en- 

The case of (1) usually studied is where A is of rank q and the elements of 7 are linearly indep" n 
dent, or L is non-singular. Since no extra work turns out to be involved thereby, we shall placeder 
restrictions in general on the ranks of A and L. Of course, if the rank of A is r, then from the ОГ 
of A it must be that q > rand n > r. : ў w 
ne Post-multiplying both members of (1) through in turn by 7’ and { and using (6) and @ 

ёр = AL, él’ = U? ad 

Formulae (9) are well known for the 3-law; they hold more generally f -law ‘ 

The basic necessary condition (2) follows from Denon bak A ars of (D oy d 
and using (4) and (9). Condition (2) involves no score matrix of (1) directly. It is well know” 
the case of a diagonal U*, but holds again for the more general case of the f-law. 


: : 0 

. 6. The Existence of Solutions. Hypotheses (1) and (6) by themselves do not servo ei 

pin down q, r, or U*, and certainly not А and L.. Even when U? is restricted to being v 
diagonal matrix, this does not improve the situation. While their procedures may assu! 
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fae forms, in effect most factor analysts begin by assuming the 8-law, and seek a diagonal 
that will leave R— U* Gramian. 
i Зарро some decision оп U? has been reached, and that for this U*, the rank of А — U? 
5 5 i Factor analysts then proceed to * factor" R— U? by any of several methods, to obtain 
j Eg rix А of order n x r and of rank г (that is, q = г), and a non-singular Gramian matrix 
PE Order г, such that (9) is satisfied. The most general formulae for computing such an 
wh nd L from R= U? are given in [6, р. 12] and again in [7, p. 210]. These formulae hold 
ether U? is diagonal or not. 
to te problem is: to what extent is satisfying (2) sufficient for establishing a solution 
th (1) that is also consonant with (6) and (7)? A general answer follows immediately from 
€ following lemma. 
Гетта 1. Let£beacolumn ofn vectors of rank p, each vector having a positive norm. Let R= Ё. 


Ij 
If B and G are any real matrices such that 


wh R= ё = ВСВ' (10) 
Such 5 d js а Gramian of order g and B is of order nx g, then there exists a column ¢ of g vectors 
£= Bb, Ф С. ap 


ofa For the proof, we first notice that the rank of G in (10) cannot be less than that of R (the rank 

is Product cannot exceed that of any factor in the product), and hence must be positive. Since 

s Gramian, for any finite integer f not less than the rank of G there exists a real matrix F of order 

&Xf such that 

E FF' = С. (12) 

Siem (10) and (12), R = (ВЕУ(ВРУ, so that the rank of 
обу 48 the number of columns in BF, clearly f > р. 

uii dan vector space of f dimensions. Let wy be the column of, 
Or this f-space; such a basis always exists (cf. 1, p. 193). 


Ris that of BF, or BF must be of rank p. 
Then £ can be regarded as imbedded in a 
‘f vectors that form an orthonormal 
Such Then there is some real matrix В, 
= that ¢ = Bye, But since «c; = 1; (the identity matrix of order f), the last equality implies 
orth B,B, BF and В, both being ‘factor’ matrices of Rand of the same order, there exists a real 
then sonal matrix W (of order f) such that В, = BFW [4, p. 71]. Therefore, if we let ¢ = FWwr, 
or th — Bro, = £, or the first part of (1D is satisfied. Also $$’ = (FWo)(FWor) = FF’ = G, 
he second part of (11) is satisfied. Hence, а Ф always exists for (11) and Lemma 1 is established. 


As an immediate consequence of Lemma 1 we have: 
are Фіеоғет І. ЈА, 1, dnd U? are given such that R= ع‎ = ALA’ t where L and U: 
W eo raian and no diagonal element of R vanishes, then. there exist two coi umns of vectors, 
10 ра L, such that ë = Аз and qy = L^ 6 = U*. and nf = 0: that is, a solution 
roblem J always exists. 
he proof consists of defining B and G in Lemma 1 by: 


LO 
B= 141.1, G= LEA (13) 
» is the identit trix of order л, By direct multiplication, it is seen that BGB' — 
LA’ Ue, Therefore, pem Lemma 1 there exists a Ф to satisfy (11). and (13). If L 
ay Order q and U? of order n, then g = 9+" Let 7 be the first q elements of $, апа ё the 
n elements. Then | 


Where 7 
is o 
l 


_flall ae ||", all (14) 
" «Jl в = |р к 
ut ddr. ; f (13) with that of (14) shows that 
r =G fr aring the second part of (2 V А 
7 and ¢ satisfy (6) ч зо Co ürthermore, by direct multiplication according to the first 
isxts of (13) and (14) B$ a А+ or—using the first part of (11)—we have verified that (1) 
atisfied. This establishes the existence of n and ё for Theorem 1. | 
ең Theorem 1 generalizes Theorem С of [7] to infinite N and to the general Euclidean 
Or space, and to the case where £ is not regarded as fixed in advance. 
It is ofj mmn here are and how they are interrelated. 
Fro pee E inquire further hon that cannot be uniquely determined if the rank of G 
me CAS the rank c f Emma: Oi dingly,in Theorem 1, if the number of linearly independent ele- 
Mts in a pair бгз m FOE ET (the rank of R), more than one pair of solutions exist. 
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Results of considerable importance are obtained by looking closely into the cases where p is 
less than the sum of the ranks of L and U?. This we shall do by constructing all possible solutions 
to Problem I. An incidental consequence will be an alternative proof of the statements in the pre- 
ceding paragraph. More important, detailed information will be made available as to the relation- 
ship among alternative solutions ana the implication these have for the foundations of common- 
factor theory. The Spearman-Thurstone theory is an important example of where p is less than the 
sum of the ranks of L and U*. 


7. A General Construction Lemma. Lemma | asserts the existence of a solution ф for 
(11), given (10). We should now like to show how all such solutions can be constructed. 
This is done in Lemma 2. 


Lemma 2. A necessary and sufficient condition that ¢ satisfy (11) in Lemma 1 is that it be of the 


їн ¢ = Кё-+-Со=-+-8, (15) 
where К is a real matrix of order g X n and satisfies (16): 
KR — GB': (16) 
C is of the order g x f, as defined by (17): 
C = (1,— КВ)Е, (17) 
Е being also of order gX f and satisfying (12); w is a column of f orthonormal vectors satisfying (18): 
Co£' — 0, wo’ = 1,; (18) 


and 0 is a column of g null vectors satisfying: 
B0 = 0, 00’ = 0. (19? 


The proof revolves around the properties of K and the products BK£ and BC. " 
That a solution K to (16) always exists when A is non-singular is obvious, for then K is uniquely 


determined as 
K= GB'R^ (|8|>0) (20) 


When R issingular, we can lean on the theory of least-squares to see that there is not only one solution 
K to(16), but infinitely many. For Lemma | assures us that at least one ¢ exists that satisfies (11). 
If we expand thel eft number of (21) and notice from (11) that £¢’ = Bé¢’ = BG, then we see that 
is equivalent to (16): 

@—КӘё = 0. QD 


For function space, (21) is precisely the set of normal equations for the least-squares predictions 
of the variables in ¢ from the variables in £, and it is well known that a best set of regression weights 
K always exists, although not uniquely determined when ££' is singular [13, pp. 151 f.]. The proo 
in [13] is given as for a function space, but clearly holds for any Euclidean vector space. 
. . Equality (21) reveals the * statistical’ meaning of the proposed construction of ¢ in (15). KE 
is the * least-squares’ estimate of ¢ from £, and we have to show that Cw+-0 is always the matrix 
of errors of estimate. it i 
That an o always exists to satisfy (18) should be evident. Indeed, for the first part of (18) 115 
sufficient to have «£^ = 0. When our vectors are all statistical variables, w can be defined by throwing 
dice or turning a roulette wheel and making the variances all equal to 1. Or сап be taken aS an 
orthonormalization of any set of f linearly independent scores that are uncorrelated with those 0! $ 
That a 0 always exists for(19) is obvious. For example, choose 0 = 0. This is the only chor 
possible if B’B is non-singular, but there are many other possibilities when B’B is singular, especially 
ina function Spares ^ ely, 
Granted that the right member of (15) always exists, wi i iency, name» 
thatit satisfies (11). To this end, it will be helpful to eh ee hid ciae 
,,, Proposition 1. If E and M are real matrices such that MEE’ = 0, then ME = 0. similarly 
if is a column of vectors such that Mish’ = 0 and the norm of each vector in y is positive, then TM". 
The proof of the first part of the Proposition follows from the identity (MEYMEY = MEE M: 
The right member certainly vanishes when MEE’ does, and in particular its main diagonal does: 
but the sum of the elements of this main diagonal is the sum of squares of all the elements in ME, zd 
each element in ME must vanish. To prove the second part, we can let у = Eywo, where wo 15... 
orthogonal basis for the space of у. Then Му = MEE; eas 0, whence ME, = from the firs' 
part of the Proposition. Therefore ME;», = Му = 0, or the second part is established. f 
Notice the requirement in the second part of Proposition 1 that the norm of each element © 
be positive. This is to ensure that My should actually equal zero and not just be null. 


70 


iu^ Locssc л 


а‏ خخ 


L. GUTTMAN 


To proceed with the proof of sufficiency for Lemma 2, let d, and ¢» be defined respectively as 
Ф = К, $2 = Со, (22) 

So that (15) can be rewritten as 
$= ditdet 6. (23) 
Then фу is the * least-squares ’ estimate of d from ё, and ¢ is our candidate for the error of estimate. 
. Since К always exists, so does ¢;. But unlike К, ¢, is uniquely determined by (16) even when R 
t Singular. For suppose К and K* are any two solutions to (16). Since (K—K*)R = 0, Proposi- 
lon | assures us that (K— K*)£ = 0, or К*ё = Кё = ¢, even although К+ K*. Unlike Фф, $; is 
Quite arbitrary, being dependent on a rather arbitrary w; indeed ¢» is fixed if and only if C = 0. 

€gardless, we shall now show that always 


Bó,—£, Вњ = 0. (24) 
Pre-multiply (16) through by В and recall (10) to see that 
ВКК = К, (25) 


r4 (BK—L)R = 0. Therefore, from Proposition 1 (BK—Jn)f = 0 or—expanding, and recalling the 
first part of (22)—the first part of (24) holds. Similarly, if we сап show that BU,—KB)G = 0, 
20 from Proposition 1 and from (17) and (12) it will follow that ВС = 0. Expanding, we find 
aU, — KB)G — BG BKBG = RK —BKRK' = RK'—RK' = 0, the third member following from 
(16) and the fourth member from (25). Therefore BC = 0. Certainly then BCw = 0 for any w, 


9r the second p; 

part of (24) always holds. А 

the c. he proof of (Sa) could bea bit shorter for non-singular R for then BK = In from (25). It was 
ase of singular R that required us to invoke Proposition 1. 9 Е 

е сап now реч йр (23) through by В and use (24) and (19) to obtain Bó — £. This 


veri; h H " 
sone A, constructed according to (1) must sari he must now see about the second 
From (22) and the first part of (18) it follows that 
didi = 0. (26) 
Post-multiplying each member of (23) by its own transpose and using (26) and (19) yield 
Post. | gp = didit $$ | (27) 
“multiplying the first part of (22) by its own transpose yields 
didi = KRK’. (28) 
Post-multiplying the second part of (22) by its own transpose and remembering (17) and (12) yield 
dubs = CC’ = (I, KB)GU,— B'K). (29) 
Post-multiplying (16) through by K' shows that A 


th KRK = GB'K' = KBG, dn 
last mem i i ing the transpose of the middle member as is justified by the 
Sy mmetry of ber Sero ned Lele the last member of (29) and using (30) and (10) yield 


CC’ = G—KRK’. 
Therefor, 


completed from (27), (28), (29) and (31) we see that the second part of (11 
ie proof of the sufficiency of (15) TE UN Fey is now relatively rapid. Let ¢ be any 
(olution i Latin on Ше dar a OD). As we have seen, ¢, is ane m no daa 
sarily pud G and dus is always fixed for our problem. It remains to be shown ü 
of th ja | | 
“хра ost-multpiving © through by K' and using the definition of $, in (22), and then in turn 
"ding the resulting left member, show that 3 


(4—4091— 0, Ф = didi = hb. 


(31) 
) is satisfied, or we have 


H : 
nce, expansion of the left member of (33) and use a ie лы A 
二 3 
the la 人 一 知人 一 向 hen from (33) and (31), 
st member following by recalling (I1}and (28). Then ү (34) 


= J) = cc’. 
($—$ : " 
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If any main diagonal element of CC' vanishes, so must the entire corresponding row of C. Let 
c be the number of non-vanishing rows of C, let C, be the non-vanishing c rows of C, and let (ф— Ф): 
be the c vectors of (6—¢,) with positive norms. Then from Lemma 1, there is an orthonormal we 
such that (6— 4). = Су, [in Lemma 1 replace £ by (6—¢1)e, B by Ce G by I,, and ¢ by we]. Let 8 
be a column of vectors which is the same as ф— ¢, for the g— c null elements, but with zero elements 
elsewhere. Thus, 8 satisfies the second part of (19). Let w be any column of f orthonormal vectors 
containing we and arranged so we can write 


@—Ф) = Cu+0. G5 


This can be done, for Cw contains Со, and only zeros elsewhere by virtue of the vanishing rows of C. 
Hence $— Ф, must be precisely of the form implied for ¢, 0 in (23). х 
One detail not explicitly pointed out yet is that Со here necessarily satisfies the first part of (18); 
but thisis precisely what (21) implies. A final detail is that @ must satisfy the first part of (19). This 
follows by pre-multiplying (35) through by В and using (11), (24), and (22). 
The proof is thus complete as to the necessity as well sufficiency of (15) for (11), or Lemma 2 
has been established. 


8. The Construction of all Solutions to Problem |. Definitions (13) for B and G and 
notation (14) for ф transform Lemma 2 into: А 

Theorem 2. А necessary and sufficient condition that y and © satisfy Theorem 1 is that 
they be of the form: 


л = K£-Po--0, {= KE—APo— А6 G6 
where K, and К, are of orders q X n and nx n respectively and satisfy: 
K,R = LA’, K,R = U?; (37) 
P is of order q x f and satisfies: 
PP’ = L—K,RK3; (38) 
w is a column of f orthonormal vectors satisfying: 
Po£' — 0, ww’ = I, (39) 


and 6 is a column of q null vectors (00 = 0). 

Conditions (37) constitute but a direct restatement of (16), letting K, and K, be the first d 
and last n rows of K respectively. In (38), P is the first д rows of C in (17); the right member 
of (38) is the Gramian submatrix of the first д rows and columns of the right member of ( ). 
The last n rows of C must then be equal to — AP, from the fact that ВС = 0 and from the 
definition of B in (13). We have used this dependence of the last rows of C on the first @ 
in writing the coefficient of w for С in (36). Similarly, the @ in (36) is the first q elements 0 
the 0 of (15), while — 40 in (36) is the last л elements of the 0 of (15), according to the first 
part of (19); thus in Theorem 2, the analogue of the first part of (19) always holds and nee 
not be stated explicitly. Orthonormal w is the same in (39) as in (18), and clearly the firs 
part of (39) is equivalent to that of (18) from the dependence now of the last » rows 0! ©? 
namely AP, on the first q, or on P. 

Thus Theorem 2 is but a restatement of Lemma 2 for the special case where 
are of the form (13), and its proof has been completed. in 

If P ¥ 0 in (36), clearly infinitely many solutions 7 can be constructed by (36), and ! 
general also infinitely many solutions С, by virtue of the arbitrariness of w. But it 15 also 
clear that the extent to which these solutions can differ among themselves, as measure 
by the norms of their differences, should be limited by how close the main diagonal elements 
of PP’ are to zero. The precise nature of this limitation will be established next. 


9. The Maximal Difference among Alternative Solutions. Given that ф is one solutio? 
to (11), let us seek another solution, ф*, to (11) that is maximally different from ¢ in the sens 
that the norm of each vector in the difference ф—ф* isa maximum. From (23), we can write 
$* = pitz +0*, where ¢, is the same as for 办 but фЁ--0* is specific to ф*. Then pz 
is the same as ф«—ф: except for the null column 0—0*, so our problem is to max! 
the norms of the elements of $;—4$2. These norms are the main diagonal elements of 


(4.—42)0.—05) = фара фаб —diji- didi. à 
Since paps = a= $’ CC’, only Pap’ and ¢$4:—which have identical corresponding 
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al elernents— esi vary with ¢* in the right of (40). Our problem reduces to maxi- 
since ф m maip diagonal elements of — фф. From Schwarz's inequality [1, p. 190], 
element E p3 have the same norms for corresponding elements, each main diagonal 
Сау Bi —ф+ф:' cannot be larger than the corresponding diagonal element of ф+ф,. 
ا‎ is maximum is attainable for all diagonal elements simultaneously by letting 
4, E verify that —ф can actually constitute a column of errors of estimate, notice that if 
m € in (15), then we can write $3 = С(— о), and —w satisfies (18) just as well as w 
ES the According to Lemma 2, our maximally different ф* is as legitimate a solution to (11) 
given ф is. 


{лее results can be summarized as: 
тта 3. If ¢ is a solution to (11) in Lemma 1, then there exists a $* that is maximally different 


from У 
а و‎ maximal squares of the norms of the difference column $—4$* being the main diagona 


7 ($—4*)(6—9*) = 4CC', (41) 
B s as given in (29) and (31). 
holds le right number of (41) comes from (29) and the fact that $;—$1— 29:. 
Since ¢—¢* equals $,—4£ up to a null column. 
Returning to Problem 1, Lemma 3 leads to: 


Theorem3. If x and ¢ are a pair of solutions for Theorem 


The equality in (41) 


1, then there exists a maximally 


di Á 
ifferent pair of solutions, n* and &*, such that: 
(5—79(—75*) = 4PP' = 4(L— K,RK;) (42) 
(Citi = 4APP'A = 4(U*— К.КК.), (43) 


Where the real matrices of (42) and (43) are as defined in Theorem 2. . 
he equalities in (42) and (43) are merely expansions of (41) for case (13), using 


entities established for Theorem 2. | . р 
or our application to statistical variables, PP’ is the covariance matrix of the errors 
ce matrix of the errors 


9f estimat ; i i 
€ of » fi / 4’ is the corresponding covarian f 
: den E O dn j a, the maximal variance of the 


Suspect that £ will have to afford very close estimates of a given pair 7 and £ if all other 
i terially from this given one. Such a suspicion 


the next section show. 
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Table 1. The Minimal Correlation (p*) always attainable between Two Alternative Solutions for 
the Same Factor (Common or Deviant), as a Function of the Multiple Correlation (p) of that Factor 
on the Observed Scores. 


P :00 -30 60 -80 90 -95 97 99 
p* —100  —82 —2 -28 .62 -81 -88 -96 


To ensure any positive correlation at all between y and y* (z and z*), the multiple correlation 
of y (z) оп ê must exceed +71. But Table 1 reveals that even when p reaches -80, the corresponding р 
is only “28. The situation improves, but not too much, when a p of -90 is reached. Only when a p 
as large as .99 is attained do the alternative solutions necessarily begin to correlate by as much as 
*96 with each other. 

In a previously published numerical example where q = 3 and л = 7, it turned out that the p 
were -90, -89, and -88 respectively for the elements of 7, while they ranged from :59 to :92 for the 
elements of [5, pp. 182 f.]. It should now be evident that having determined 0°, A, and L for such 
data does not go too far towards fixing any corresponding score matrices ; and &. — р 

Godfrey Thomson reported that the p for nine * primary traits" in Thurstone's original analysis 
range from -630 to .908, and remarked that: ‘ Those correlations do not look so bad ’ [15, р. 339]. 
If we look at these p again from the point of view of p*, it seems that the sought-for traits are not very 
distinguishable from radically different possible alternative traits for the identical factor loadings. 


11. The Case of Non-Singular R. When R is non-singular, some formulae above can be 
conveniently expressed in terms of R^. From (37), we have K, and К, uniquely defined by 
К, = ГАК, Ka es URS (45) 

If we let 7; and £, be the predicted values of т and ¢ respectively from £, and 7; and Ča the 
corresponding errors of prediction up to a null column, then from (36) and (45) we can write: 


m = LA'RA£, G = URE, (46) 
ть = Po, a = —APw. (47) 
From (46) and (4), we find 
qmi = LAR AL, GG = URU. (48) 
Now, from (27), 
an = nia. (49) 


Then, from (49) and the first parts of (7), (47), and (48), we have the identity: 


aan! = PP’ = L—LA'R? AL. ad 
Similarly, since 
Ww = bU bU, en 
we have the identity 
GU = 4PP = Ut URU, (52) 


gs An interesting result of (52) is that, if U? is non-singular, we derive an identity for 
| R = U?—U-:APP'A'UA, en 
This is obtained by pre- and post-multiplying (52) through by U-?. 


2 
12. The Case where L and U* are Non-Singular: The Matrix Q. If both L and EA 
are non-singular, further interesting and useful special identities emerge. First, let us noti 
the important fact that U? non-singular implies R non-singular, which may be worth stating 
as a theorem: 


Theorem 3. If К, U?, and R— U? are each Gramian, and if U? is non-singular, then KR 
non-singular. : 

This follows from the theorem that the rank of sum of two Gramian matrices canno, 
be smaller than the rank of either term in the sum (4, p. 73], and clearly R = (R-U) ` 
In our case, R— U* = ALA’, according to (2). 
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Notice that L non-sin i i 
à 1 -singular by itself says nothing at all about the rank of R, but the 
ш ALA’ does (being a lower bound to the rank of R). 
eal ince we now assume both L and U? to possess inverses, we are at liberty to define a 
Symmetric matrix Q of order q by: 
- Q = L4- A'U^4A. (54) 
ith this definition, we immediately establish the following theorem: 
Theorem 4. The matrix Q as defined in (54) is Gramian and non-singular. Furthermore, 
1 Q^ = PP’ = L-LA'R?AL, t (55) 
Wi P 
tere Р is as defined in (38), or Q^ is the covariance matrix of the estimates of the common- 


4 is to multiply the right member of 


Perhaps as easy a way as any to prove Theorem | mul 
duct (in either order) reduces to Iq, 


(54) by the 
， last member of (55) and see that the pro 
using the fact that ALA' — Lu Р 


(53) That the inverse of R in the right of (55) exists is assured by Theorem 3. Indeed, we can re-write 


Ident Ro = U-—U-*AQ^A'U-. (56) 
and ШУ (56) was established before in [3, рр. 91 f.] for the special case of the 8-law (U* diagonal) 
y ju here L-— [, and q = r. For that case, it often serves as an economical way of inverting К 
for 7 "ting a usually much smaller matrix О [5]. The identity holds more generally for the B-law, 
т « and for q4- r, although it need not lead to parsimonious calculations in the general case. 
ident multiplying (56) through by L4’ and recalling (45) and (54% establishes a further useful 
(57) 


F K, = LA'R = Q^4'U^. 
Or the 8-law a. Д ; i in [3] and [14], bei 

8 nd = = г, (57) was established previously in [3] and [14], being 
Beneralizeq to L+ Lin LOS zur d (RS Оу further generalized it to the £-law and 4+ г. 
nother way of arriving at (57) is to note from (26) and (14) that 


лїї = 0. 


Hen, 
Se from (58), (45), (46), and (47), 
Fo K,U? = LA'R’ U? = PP'A'. (59) 
T 4 и ) 
singu only the assumption that R- exists is needed. If in addition both U* and L are non 


g Ме can immediately derive (57) from (59), using (55). 
error covariance matrices can now be Written as 60) 


accordi тт = Q^ DU = AQA’, А iem 
or Bing о 0), (52) and (5). No diagonal element of D ream vanhat For tne rank ОГ dO 
fh ran malari, at eat diagonal бетеп o, of р. 7}, Since these diagonal comen 
Variances of the errors of prediction from ё, we can state: 
L and U* are non-singular, all common-factors and at 

ces of errors of. ‘prediction frm. | x 
remark that identities for the covariance matrices 
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In developing a psychological theory, any set of observed variables to be factored will ordinarily 
be regarded as but a sample from an infinitely large universe of content. Since we now wish to study 
what may happen as n — ©, we can no longer in general consider N to be possibly finite. Of special 
interest is where R is non-singular for all z. In order for R to be non-singular in (4), clearly it is 
necessary that N be not less than п. (Indeed, if the linear restriction is laid on the xj; that their means 
over i be zero, we must have N—1 > n.) 

In any event, finite N is not the case of greatest interest. Not only does this restrict л to being 
finite for non-singular R, but it prevents the x; from being continuous random variables (in particular, 
from having a theoretical normal distribution). In a quest for scientifically meaningful factors, 
finite N occurs usually for but a sample from an infinite population. Sampling problems cannot 
be studied profitably until the population problems are defined. Sampling considerations are 
secondary to the basic problems with which we are concerned here. Our problems remain even 1 
there is no sampling error due to people, and do not arise because of such error. 


From now on we hypothesize that N = co; our basic population may be countably 
or not countably infinite. It was largely for this purpose that the conventions about score 
matrices were introduced in §3 above. We now wish to consider what may happen as 
n increases. 

In the general case, q and r are functions of л, so we shall write д, and rs (п = 1, 2,..). 
If U2, A, апа L are determined for a given л, the elements of these matrices may possibly 
have to be modified—apart from the enlargement of their orders with л, gw and r,—to keep 
R— U? Gramian as n increases. (In the special case where д, and r, can remain constant 
for all n sufficiently large, say for all n > n’, then the L determined for n = n’ can possibly 
remain constant thereafter, as can also the first л” rows and columns of U* and the elements 
in the first n’ rows of A.) 

To emphasize their dependence on л, we shall now attach the subscript п also to the 
matrices defined previously in this paper. In the case of the column score matrix £, of the 
first л observed variables, the dependence on z takes the form only of enlarging the number 
of variables in the matrix and not in changing those already in the matrix for a smaller value 
ofn. Similarly, the observed intercorrelation matrix R, depends on л only in that the number 
of rows and columns increases with 7; but once an element occurs for a given л, it stays fixe 
in value as л increases. Not so for A;'; here each element changes value in general as 7 
increases [cf. 8, p. 281]. In consequence, each element of K,, and K,, in (45) may change 
with п, as well as of P, in (50) and (52). Correspondingly, 7, and $, may have each element 
vary as n increases, according to the construction in (36). In particular, each of the elements 
of each of the column factor components may vary: qin, Nans бал, Cane | 


14. Conditions for Determinacy. What we have to consider in place of (2) isa sequence 
of equalities: 


Ry = AnLnA3+ U}? = 1,2.) © 
for which we seek columns 7, and £, such that, in place of (1), 
En = Aus C, (r= 1,2,-9 (63) 


and (6) and (7) are satisfied for each л (or better, for all zz sufficiently large). чр the 
We shall say that a factor score (element of a column score matrix) associated with t 
sequence (62) is determinate if and only if lim p, = 1 for this factor. Clearly, а necessary 
а ой no 
and sufficient condition for the kth common-factor to be determinate is that the kth diagonal 
element of тл = P,P, tend to zero as п co. Similarly, a necessary and su cien, 
canamaz ror Бел denant fator to be determinate is that the jth diagonal element 9, 
ēnbon = And nf nA tend to zero asn — оо. But the vanishing of the diagonal element O' ^ 
Gramian matrix necessitates the vanishing of the entire оо та And column. The 


vanishing of all diagonal elements implie аны иса y a matris- 
Hence, the theorem: plies the vanishing of the entire Gramian 


Theorem 6. A necessary and sufficient diti T terminate 
for (62) is that the left member of (64) exist and (64) holde all common-factors be dete 
(64) 


lim P,P} = 0, * 


n 
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Th i 7 В ч 
h A corresponding condition for deviant-factors is that the left member of (65) exist and (65) 


lim A,P,P;,A; = 0. 
| | 1 lim. РА 0. (65) 
If К, is non-singular for all n sufficiently large, then (64) is equivalent to 
lim (Ln—L, An Rw AnLn) = 
Jin n( L,) — 0, (66) 
and (65) is equivalent to 
lim (U;— UR" Ui) 0. (67) 
noo 


E А : 
А La and U are non-singular for all n sufficiently large, then (64) and (65) are equivalent to 
© respective parts of (68): 


lim Q;! = 0, lim 4,074; = 0, 
Hn n> 


(68) 


where Q, is „as defined in (54) for all n sufficiently large. 
pe, equivalence of the above conditions to each other follows from (50), (52), and (60). 
Vidently, from (44), if p, = 1 then pz — 1 for the same factor. Or, from (42) and (43), 


if > : 
(68) holds then all variances of differences between alternative solutions tend to zero 


п 
T), 90. Hence Theorem 7: А 
Fieve teorem 7. А sequence (62) can lead to at most one column of determinate common- 
all а, з зау т, and at most one column of determinate deviant-factors, say boo. That is, 
native solutions for a determinate factor are equal up to null columns. 


* devis Tile Case of Diagonal U} (the 8-law). Although we have used the terms * common ' and 
ie pant ` for the Eae t conte nothing asic in the algebra thus far that distinguishes between 
Our al э pipes of factors, The B-law as expressed by (6) is essentially symmetric іи апа č. Indeed, 
tial m ra would be symmetric throughout if we had considered not 7 but A7 in (1) or (63). Essen- 
Ymmetry begins to occur when further restrictions beyond the £-law are laid on either 7 or 2. 
metry, aw, or combining the y-law with the B-law, leads to a particularly fundamental type of asym- 
Veer then diagonal U} becomes intimately related to the main diagonal EE in order for (67) 
to hold q^ be the jth diagonal element of Uh. If us is always positive, t = ol anly i КЕ 


hold x ment ¢ 05 
This į pis non-diagonal elements in thejth row and column of R,-? must ten 


Tor Theorem 8, If all deviant-factors are determinate and if Us is diagonal and non-singular 

7 sufficiently large, then Rz? must tend to a diagonal matrix as n > а I 

Roi Slice that Theorem 8 assumes U, to be non-singular for all п sufficiently large, so 

ù, ists for all п sufficiently large according to Theorem 3. That the main diagonal of 

as wi] ust always have a limit as п © is a consequence of multiple-correlation theory, 
ll be marked in more detail in the next section below. 

i n discussed from a point of view closely related 

pa E dont 1/08. p. 282 and pp. 294 f.), this tendency 

ееп p : Jaw structure at all to the uni- 
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te and have non-vanishing uniqueness 
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of the variance of the errors of estimating х; from the remaining n— 1 variables from the 
observed universe. Let o2, denote this variance [the * partial anti-norm ’ of x; in the ter- 
minology of (8)]. 

Inspecting the main diagonals of (67) shows that for each j: 


i 
lim us (1-2) =й (69) 


no jn 


Since a variance of errors of estimate cannot increase as the number of predictors increases, 


o}, is a monotonely decreasing function of n for all j, so there always exists lim aj, (the 
п 00 


‘total anti-norm* of xj, which we shall denote by oj. (We have used this existence 
proposition in Theorem 8; it implies that each element of the main diagonal of Кї always 
tends to a limit, finite or infinite, as » — co.) 


Considering (69) and its source in (52), we can state: 


Theorem 9. If there exists lim ug, say ui, then a necessary and sufficient condition 
no 


that the jth unique-factor associated with sequence (62) be determinate for diagonal Us is that 
either иј = 0 OF ui = о}. 


The restriction of Rn to being non-singular has been omitted in Theorem 9, for it has been shown 
in [8, p. 293] that u2, < of, even when R, is singular. [That 4, < о}, when R, is non-singular follows 
also from (52) for diagonal U2.] Hence, if c2, = 0 for any n, tt}, = 0 for that л, and Theorem 9 holds 
also for this singular case. 

According to Theorem 9, if the main diagonal of R, is to be modified by subtraction of a non- 
singular U} to leave only the contribution of common-factors in (62), the choice is limited only to 2 
U3 which will tend to whatever the main diagonal of R;! tends. This provides a solution to the 
. problem of communalities. From the point of view of determinacy of factors, the ideal choice © 
uniquenesses is oj, (j = 1, 2,...). MAUS t 

A striking feature of this solution for uniquenesses (or equivalently, for communalities) is {һа 
it lays no restrictions on the common-factor space. Nothing is said about the rank of R,— Us for 
any n, nor about the determinacy or location of the common-factors. The limit of the rank of the 
common-factor space can be finite or infinite. This may help explain why past attempts to 50 ts 
the problem of communalities for finite л have not been successful. Apparently, only lower boun 
to communalities are universally possible for finite л [cf. 10]. The pj, afford such bounds, and mus 
tend to the actual communalities if the unique-factors are to be determinate with positive uniquenesses- 


17. Implications for Problem III: the Scientific Meaning of Factor Analysis. Since 
Spearman's original analysis of the problem when gn = ғ, = 1 and U} non-singular (ti 
2,…), it has been generally recognized that factor scores cannot be estimated exactly from the 
Observed content scores. 


For example, Holzinger and Harman state: “ Since the total number of factors (both common 
and unique) exceeds the number of observed variables, the value of any particular factor for а give’ : 
individual cannot be obtained by direct solution but can only be estimated from the observed таш 
of the variables. Тһе best prediction, іп the least-square sense, is that obtained by the ordina у 
regression method” [12, р. 264]. Thurstone seems to regard this estimation problem as EE. 
a practical one; and his position is that “ the problem of appraising an individual as to each of sever? 
factors is best solved in the practical situation by using a test composite for each factor " [16, р. 5 for 

These authors are focusing on a common-factor score matrix 7,. They apparently take it 
granted not only that 7, exists, but also that it is uniquely determined by Ug, An, and Ln, even thon 
it can be only approximated as a linear function of the variables in ёл. They do not discuss what 
be said about 7, beyond its predictability from ES. T " his 

Thomson has gone more intensively into this problem, and Ledermann has generalized 
conclusions. Assuming one pair of solutions yn and fn exists. they show how further solutions hile 
be obtained from this pair for the same А, and U2, assuming L, = /,_ [cf 15, pp. 371 fJ w d 
recognizing that “ usually the accuracy is very low indeed ” for predicting factors from the observe 
fn [15, p. 336], Thomson has not considered the relationship of p* to p as was done in $$ 9-10 аро 2 
As remarked above, Thomson considered that p between -630 and -908 “ do not look so bac * 
whereas from (44) this implies p* between —-206 and *650. to 

Jt seems safe to say that there has been considerable complacency about being able he 
ascribe particular scientific meaning to 7, and Z, on the basis of A,, L,, and Un, even if 
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factors are not determined closely for the given т. It has been especially true with respect 
to common-factors that they have been named according to the content of the observed 
variables that have * high ’ loadings on them in An. 

Thurstone has further insisted that “ it is a fundamental criterion of a valid method 
of isolating primary abilities that the weights of the primary abilities for a test must remain 
invariant when it is moved from one test battery to another test battery ” [16, p. 361, italics his]. 
е again is expressed the belief that A, delimits some particular y, with no reference to 

€ p, Or px. 

It арсы from the relation of p? to p; (for each intended factor) that the predictability 
of factors from £, is not merely a practical problem. If p is low, it raises the question of 
what it is that is being estimated in the first place; instead of only one * primary trait ’ there 
аге many widely different variables associated with a given profile of loadings. If the 
Scientific usefulness of factor analysis is to rest on its finding factor scores that can serve as 
meaningful reference axes, surely these reference axes should be defined by the analysis as 

ing distinct from alternative axes. -— Ы 

No matter how well an A, satisfies a given criterion of rotation, and no matter how 
constant some of its rows may be from test battery to test battery, а particular set of meaning- 
ful reference axes is still not defined if p; does not tend to unity for these axes. се 

in practice p, does not get to be well above :90, say, for most of the factors, so that рп is 


generally above -60, it i eaningful reference axes have been established by 
А it is hard to see that any meaning fere i е 
Somputi 1 а arithmetical criteria and manipulations аге used. — 
puting An, La, and U3, regardless of what hether in principle a given universe of data admits 


Attention may be focused on the problem wl un i a 

огап порна detonate set of common- and Деу ЫЕ О Ermpikical evidence г scanty 
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fae i i determinacy. I 
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t theory, p^ DIS e md experimental approach to the hypothesized peters po rega 
50 151 and verify their nature. If more direct observations on the 7, an have to be discarded for 
Atistical analysis of А, and £s, the Spearman-Thurstone approach may 


of determinacy of its factor scores. 


«e 18. Problem IV: * Inverted’ Factor Analysis. The proposal rs on peen d 
[actor * people rather than variables, especially when N is finite an sma ал S IS 
chaps. ХШ and XIV]. Discussing such a proposal only for finite N and n = ЗЧ ae $ 
De problem of determinacy of factor scores. Artifacts due to Anien ° e mpe 
fia Certainly occur in the algebra of the usual discussions. The varia ee n > longer 
theoretically have normal distributions, as already remarked, nor any other typ 

inuous distribution. 


the үеге is nothing mathematically 

conside? ei framework is specified. 

Som 这 What happens as N — co analogously to d 

now тайы, inner-products for the Euclidean vector еркен nplied the verter solved empirical problem. 
е 

Sart out E EO a i, nite, and to regard t 


servat dec ese rather than a finite-di 


of factoring people, provided 
opiem o be infinite. One could 
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19. Problem V: Second-Order Common-Factors. The solution to Problem I has 
important implications for the concept of ‘ second-order’ factors. It has been suggested 
that when q = r and L + Jr, or oblique common-factors are used, it may sometimes be 
meaningful to * factor’ these r common-factors in turn [16, chap. ХУШ]. The procedure 
would be to seek a diagonal matrix, say D, such that L— D? would be Gramian and of rank 
less than r; say the rank would be s where s < ғ. The assumption is that thereby s * second- 
order ° common-factors and r * second-order ’ unique-factors are determined which underly 
the original r common-factor. 

Now, Problem I applies to the second-order factors as well as to the first-order ones. 
If | D*| > 0, then the solution to Problem I shows that infinitely many sets of second-order 
factors will satisfy any given rotation in this ‘ second-order’ space. Only as r > oo can 
the embarrassment of too many solutions possibly disappear. 

However, advocates of second-order factors usually also advocate having r finite and 
relatively small. Requiring r to become indefinitely large would contradict a basic hypothesis 
of this school of thought. If r is to be finite, this requires either: (a) giving up the notion of 
second-order factors, or (6) hypothesizing that the second-order factors scores are in principle 
indeterminate for any given rotation of axes in the second-order space. In case (5), the 
question might be raised as to the psychological meaning or usefulness of the concept 0 
second-order factors as reference axes. 


20. Problem VI: Rotation of Axes. One of the currently outstanding questions of 
common-factor theory is that of rotation of axes. This concerns the term An in (1) 
Given that A and 7 satisfy (1), let T be any real non-singular matrix of order q and let A, 7 
and L be defined by 

A = АТУ, ў = Т Е = ТЕТ. (70) 
Then clearly 
Аў = An, (71) 


or A and 7 satisfy (1). Furthermore, 7 satisfies (6), and also (7) and (9) with A and Lin 
place of A and L. = 

7 is called a ‘rotation’ of 7, being an ‘ orthogonal rotation’ if Т is an orthogonal 
matrix. 7 is a solution to Problem I when А and L are replaced by А and L. Hence there 
are as many 7 corresponding to a given rotation matrix T as there are 7 for (1) as is. The 
same indeterminacy problem of scores holds for all rotations if only an indirect analysis 15 
made via А and 7. Since the criteria for rotations used by different schools of thought are 
in terms of the parameters or elements of А, these again ignore the determinacy problem 
of scores. Although we arrived at a solution to Problem II, or the determination о U’, 
this does not help at all to solve Problem VI or the fixing of T when А is given. 

Enlarging the perspective of Problem VI to include also Problem I suggests that perhaps 
undue emphasis has been placed on Problem VI in the past. If reference scores 7 are € 
going to be pinned down thereby anyway, according to the solution to Problem I, why bothe 
about rules for А and T? Such rules alone may not enable one to make much use of the 
implied factors, or to distinguish them from quite different factors. "E 

As remarked in $ 17, perhaps factor analysis has strained too much to try to isola 
meaningful factors by means of a correlational analysis alone. Many important and use! А 
things can be learned about the structure of and R from a correlational analysis 21010 
without recourse to reference factor scores, as radex theory has shown [9, 11]. Per io 
we should be content with this latter type of structural information until we can P 
direct observations and experiments beyond £ and R in order to determine 3. 
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ERRATA 
The edi i have been made in printing Professor Guttman's 
artic editor regrets that the following errors 22 n printing t Scale’ in the pre- 
ceding Оп “An Additive Metric from all the Principal Components 


issue of this Journal, (УШ, рр. 17-24). 
P. 22, equation (3). Instead of “8 " read “ ij". 


1 
г 一 五 foxo0. 
P.23; equation (15). The right member should read : ^ al 


| exscr ni ons. 
P. 23, equation (16). Jn the right member instead ааны = 


Р. 23, equation (20). Instead of " py" read “Po + 


P. 24, equation (22). Instead of “ру” read "pij + 
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1. Existing Attempts at Unique Factor Resolution. II. The Rationale of Proportional Profiles. 
Ir. History of Empirical Explorations of the Problem. IV. Analytical Solution for Pro- 
Portional Profiles with Orthogonal Axes. V. A Worked Example. VI. Summary. 


I. EXISTING ATTEMPTS AT UNIQUE FACTOR RESOLUTION 


“orrespo i e in nature. и ў 
pond to the presumed inherent se iS bcm fl solution were liste ч 
en earlier article [5]; six of them have been enlarged upon in a book [7] nos 2 TT ation 
tho cally the principle of criterion rotation [12], tnt got te шп asks the ЫА 
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incipio e Structure *; for in the first pl canny so pe as 
ple of parsimony; and, secondly, i Shnsists in having yielded 
Ded atism can ever be acceptable. T! nsists 1 

viachologically meaningful factors more often tha That have revea 
fa ing an impressive accumulation of comp is [8, 13, 15 M by subjectivity t 

“tor pattern from one study to another—a criterion less ane y 
nal stupid, however, be à mistake i Ep n Tt suffers from at ud b d оц. 
Comings: у е gle acceptable definition; 
8s: (I fine 

©) the ык пине dons to obtain simple structure is such og саай theoretical 
to obtain it fall far short of doing 50; ee o he Pd by electronic computers, 


Weakne. : х 
thantnesses。 The second difficulty promises d Wrigley (Quartimax) [19] 
a ], Neuhaus an experience shows that they 


nks t | 
S О the met vised by 
are ders [22], Thusine 6] and Tucker [28], though rea plemented БУКАГА, 
Howes altogether fool-proof and may still E procedure that are important and will 
Conce | 
T fact that, if a factor is to be 
ailure arises from the ДЫ 
by t pu En appreciable fraction of е pu En 
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a Will seerhible hyperplane. . This can occur first because (a) 1 d santa Es 
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stuff’ [7], e.g., by using the * personality sphere’ concept [6] or some similar principle 
of variable sampling: this is an avoidable fault of design, though judging by its frequency 
it is hard to escape. Secondly, it may occur because (b) there exist factors which are indeed 
so pervasive and ubiquitous (like g in the cognitive field, or surgency-desurgency in the field 
of personality) that no clear hyperplane can ever be found for them. The converse but equally 
disconcerting finding is that in опе and the same factorization, with л factors extracted, 
one can sometimes find more than л hyperplanes, so that there exist alternative resolutions. 
The senior author has found and reported such equally attractive alternatives [8, 10], though 
in a prolonged experience they have been rare. Other writers, e.g., Bargmann [1], maintain 
they are quite frequent; and, judging from the occurrence of alternative solutions by com- 
petitive psychologists, the discrimination of the better from the poorer or false hyperplane 
is not always easy. The principle now to be propounded considers real structure to be 
inherent in the data, but does not depend for resolution on the elusive hyperplane. 


II. THE RATIONALE OF PROPORTIONAL PROFILES 


The principle called * parallel proportional profiles’ was proposed in 1944 [3] as a method 
for avoiding the restrictions of simple structure. Only a brief indication of its nature there- 
fore need be given here. 


Since the assumptions and objectives of proportional profiles are intrinsically difficult to convey, 
as evidenced by the number of misunderstandings, certain points must be emphasized. The basic 
assumption is that, if a factor corresponds to some real organic unity, then from one study to another 
it will retain its pattern, simultaneously raising or lowering all its loadings according to the magnitude 
of the role of that factor under the different experimental conditions of the second study. No inorganic 
factor, a mere mathematical abstraction, would behave in this way, within the realm of orthogona 
factors. The principle suggests that every factor analytic investigation should be carried out on 3 
least two samples, under conditions differing in the extent to which the same psychological factors 
(working as independent, orthogonal influences) might be expected to be involved. We could then 
anticipate finding the * true ' factors by locating the unique rotational position (simultaneously in both 
studies) in which each factor in the first study is found to have loadings which are proportional to 
(or are some simple function of) those in the second: that is to say, a position should be discovera le 
in which the factor in the second study will have a pattern which is the same as in the first, but 15 
stepped up or down. Obviously, if no change in the relative importance of the same factors occurs 
from the first to the second experiment, no rotational position can be determined; but it is assume! 
that, either by natural influences or by deliberate experimental control, the real factor influences 
will differ beyond chance from one measurement situation to the other. 


In the original statement of this theorem [4], a general practical solution was not attemp- 
ted, and proof was limited to demonstrating (a) that proportionality of loadings cannot be 
attained by any orthogonal rotation whatever of two factor matrices (with the same number" 
of factors and variables) taken at random, i.e., that it depends on the existence of rea 
* organic ' structure in the data, and (b) that, if one position exists in which the loadings © 
factors in one matrix are proportional to those of corresponding factors in the other, then 
no other pair of orthogonal rotations exist for which this is true, i.e., there is uniqueness. 
It is proposed to show here how the solution may be generalized to any number of dimension 
and adapted to computation. To anticipate the formal mathematical development, © 5 
problem is that of finding D, a diagonal matrix containing the several distinct proportionalitle 
by which factors alter from experiment A to experiment B, so that we may write Vna = Vinh 
where Vna is the rotated position of the first factor matrix, obtained from the unrotate 
matrix Vo4 by an as yet unknown transformation Ал, and A4 is a rectangular matrix 
Thus VoAAA4 = Vna, and similarly VopAg = Vag. | f 

Before developing this, we should note that the method proposed uses the principle E 
parsimony involved in Thurstone's simple structure in а more comprehensive fas me 
In fact, it requires the principle to hold with respect to the general field of scientific obser A 
tion and inference (as, philosophically, it should) instead of merely within the universe cthe 
single experiment or matrix. It applies it to any two matrices (requiring that they yield t is 
same factors), and therefore to all. Simple structure hopes, and often finds, that W tie 
simplest in one matrix is simplest for many; but proportional profiles require that all 
phenomena analysed in the various matrices or experiments shall be explained by the sm 
number cf factors, i.e., by demonstrably similar patterns. Incidentally, the definition 
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' diverse conditions" c 1 i 
е and tech E sd TE шше, and probably with advantage, to comparisons of 
" e labour of arriving at the unique proportional profiles positi i 
Ks вре заг in two matrices would obviously be prohibitive: but е валаа 
article [4] P › ап analytical solution is possible, at least for errorless data. In the 1944 
solution, à Ше Е indicated for two factor problems. Before proceeding to the general 
Eos (a) th igression will be made on some experimental attempts to set up the problem, 
pisci) they illustrate the theoretical relations more clearly, (5) they may save others from the 
alls we encountered, and (c) they will indicate ways in which the theoretical solution still 


requi " ; ч 
quires adaptation to real conditions. 


III. HISTORY OF SOME EMPIRICAL EXPLORATIONS 
OF THE PROBLEM 


Sene dd e felt an urgent need of the method in psychological research; and the ten- 
to certai \ further publication has been due not merely to the pressure of other work but 
in misfortunes with empirical try-outs of the method. 
EAL been argued [7, 9] that factor analysis in general would benefit from more a posteriori 
situations, as in Burt's bottle problem [2] or Thurstone's box problem [25], i.e., taking experimental 
techni ns in which the * factors’ ог“ influences ' are well known and seeing how various factor analytic 
© thoques behave. In the present problem the advantages would be didactic (a) in demonstrating 
more 让 了 PSychologists who are more pragmatic or philosophical than mathematical that factors are 
Produces mathematical abstractions, (6) in showing whether the accidentally or experimentally 
Tequired changes in factor variance from sample to sample are large enough to produce the effects 
ling erro, and (c) in finding whether the blurring of simple proportionality by experimental and samp- 
тасса. addition to the systematic distortion by selection efforts) are so large as to affect the 
im ility of an error-free analytical solution. The last is the most important. | 
Strateq Es kinds of examples, additional to the completely errorless * pure principle ' case here demon- 
ut with àve been or are in process of trial and publication: (1) a box problem similar to Thurstone's, 
' pure 1 orthogonal factors and with the additional hurdle of powered relations departing from the 
(3) ге, Principle" used here; (2) a similar box problem with experimental errors introduced; and 
al and natural data, which contain both experimental and sampling error, but in which the 


Actors are cont à i 
rolled and known with tolerable certainty. A 
of en two first will be described elsewhere by Haverland [16] who has shown E the principle 
Which Portional profiles works excellently in these cases. The third is represented ni» examples 
Search Will be briefly described here and one of which is fully described elsewhere [16]. In both a 
0j Was made for a situation in which (for simplicity) fwo common factors, and two only, operated 
botani n Or more variables. For the first it was decided to work on growing tomato plants, since 
elp s believed that two factors, representing light and nutrition levels, would be found. For 
Illinois gong up this experiment, we are greatly indebted to Dr. D. Gottlieb of the University of 
" wo orticultural Department. lants in each, were grow 
е ati aini ants i s 
sg бїз, gea an te f plant, thickness of'si 


n from seed; and measurements 
tem, greenness of leaf, number of 


leaves ПАПУ mad ari ight 
“Caves ade on such variables as height О d ко знн AS : 
it wae: etc. In one sample the variance was made twice as great in light intensities, nad in the other 
of hydroponic nutrition. The factor scores, at least as ranks, 
however, when it was found after six 


as і T 1 
Were ncreased by 3 /2 in concentration ا‎ d 
The whole design. co ie е factor only—indeed there 


variance was due to on t 
Lo should be extracted. In this extreme factor com- 
be found that would give pro- 


iti tation could 5 ‘Tight i я 
sition of fond ‘would give a satisfactory * light intensity ' 


n mu known for each plant. 
Seemed а yt nine-tenths of the common 
Position ^ doubt as to whether any secon 
ion, no Orthogonal or even oblique РО: 


Dor 
facto АШУ of ; tion of any k : f 
or, profiles and indeed no rotation ; known botanically to be appreciably 
responsivenyone repeating is advised tO include more variables 
examples that would nevertheless 


8 erely physical) 
по Ontrollable sve anie (ate ent dis very of drives as factors [10] to suggest that 
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s. Е е factored from the tion of food deprivation in one sample and 
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Ber var; 1$ Work, by Haverland [16], had larg ny interesting 
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a factor 0-5 and the variance of that factor (still orthogonal) is doubled relative to other factors the 
loading will not equal 1-0. If we apply the formulae of Thomson and Ledermann [24] and Thurstone 
[25] for predicting the correlations to be expected after the variance of the selection variable has been 
changed (counting the factor as the variable on which selection is made), it will be found that not all 
the loadings of a factor will change in the same proportion when the factor variance is changed. 
However, over small ranges the simple proportionality is near enough, and Haverland's box example 
with errors [16] shows that it holds well enough to give a correct solution. Accordingly we shall 
here present the solution brought to that level, and leave for further invention the adjustment to non- 
proportionality and the other practical difficulties (e.g., obliqueness) raised by the rat problem. 


IV. THE ANALYTICAL SOLUTION FOR PROPORTIONAL 
PROFILES, WITH ORTHOGONAL AXES 


We now suppose that two experiments, A and B, with the same л variables have yielded 
two correlation matrices and two unrotated factor matrices, The only assumptions made 
are (a) that the matrices are of the same order, both containing л rows and k columns, and of 
the same rank, К, (b) that enough variables exist for satisfactory determination of the com- 
munalities [23], (c) that they occupy the same factor space, i.e., if the matrices are place! 
alongside to give order x 2k, the rank will remain k, and (d) that the variances of the true 
influence on these variables in the two situations are not unchanged: in fact, the variance 
ratios for the factors must be different from unity and from each other. 


Parenthetically, the method of proportional profiles should be distinguished, in objectives and 
methods, from the method of aligning two factors from different studies which is implicit in the fort 
lations of Barlow and Burt [2],! and from Tucker's related method [27] which has the rather differ i 
Cujective of synthesizing two factor studies to yield a least squares oblique solution to equation he 
below. The latter furnishes no restrictions such as are required to obtain a unique solution for th 
rotation; hence an additional criterion, such as simple structure Or the present method (adapted to the 
oblique case), needs to be applied to bring the two studies marching in step to a unique halting у 
A similar objective—getting maximum agreement between two studies—has been pursued by ГІВ 
and Neuhaus [30]: their method is distinguished from proportional profiles by being applied les 
the studies, except for the provision of common variables in the two studies. Proportional pro " 
on the other hand, requires the stage to be set beforehand, in order to provide a unique rotation 5. 
solution dependent on controlled experimentation. It does so in order that a particular relate 
may be sought, which is not merely one of mathematical convenience or economy, but is pos! 
to arise from the existence of an inherent structure. 


From a rotation of a given factor matrix we obtain in the (notation above): 1 
Voa\a = Vna, а 


and VopAg = Vap. 0) 


n 

1 Editorial Note. In correspondence Professor Cattell has suggested that I might indicate my ot 
views on the problems raised. The question discussed above would seem to be a special instan but 
the third case in the paper of ours that he quotes, viz., that in which the battery of tests is the same had 
the sample of persons different (p. 54). „Ме dealt with it less fully than the other two, because 让 
already been discussed in the references cited. For the instance he has in mind, our procedure Mis t 
to extract the initial * bipolar ' factor matrices by weighted summation, not by simple (i.e., not bY 
centroid method). With the notation there used, we should obtain 


Gi = FıHı = Livin and бз = FH = LaV$Hh. 


Identity of factors is defined to mean identit: of their -direction-cosi t therefore fol 
that G2 = GiD. Now let Fa = F,T. Then ^ ^" “ditection-cosines (p. 52). I 


lows 


RT = Fa = LV} = Livin DH. 


Premultiplying by Иг, we find T = HıDH; = урыу. The matrix, T, therefore, must 


itself be factorized by weighted summation апа ae e better 
factorizing TT’ = KK’, which would give nd We at once obtain, D, апаз. (This seems 


The use of weighted summation avoids the difficult: i а ly, th: : 
T 3 n y mentioned by Professor Cattell, namely, thé data 
his formula K and K 1, when independently computed, are not consistent. I may add that, even міса 


then 
than 
tors.) 


п oles 

but (as noted in my earlier paper) tends rather to be triangular : i di . Psych 
2 h ar: cf. Burt, C., Brit. J. Educ. ely 

IX, 1939, pp. 45-71 and refs. Butin general it seems desirable 1o determine hg dub factors separat 


heir similarity and rotate them accordingly.— С. 
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T E. 2 

Е he positions Vna and V; are by definition such that each factor column in one matrix 

аза ings that аге а simple multiple of those іп the corresponding column of the other, 

to the o pg ew factors arranged in the same order. The transformation from one 
, involved in the essential principle of proportional profiles, i i 

se шер : profiles, is thus s E 

nted by a k xk diagonal matrix D, such that: ا‎ 


Substituti i pari m ч 
ing from (1) and (2) into (3), we have: 
huis: Vosda = VoBàBD, (4) 
» Keeping solely to orthogonal transformations, 
Voa = Vogia дла (5) 


(si S i i 
s A71, when А is an orthogonal matrix). 
o handle the term on the right, equation (5) may more conveniently be re-written 


Voa = VonK, (6) 
where K=AsDA4. (7) 


insane” Voa and Vos are the given factor matrices, defined as to number of factors, 
necesse Ste., by the experimental process. Consequently they provide the empirical basis 
an шу for calculating К and the values derived from it. Assuming K calculated,! let us 

Ow to decompose it into the values required. We first multiply K by its transpose, 


Obtain; 

taining KK’ = Ap DX РАв = Ав D?Mp. | 
matri his expression has the typical form for the latent roots and latent vectors of a symmetrical 
latent 区 9 D? being a diagonal matrix containing the latent коо and Ав дш the 
ecto ing the latent roots and vectors in the usual way, 
w ts. Consequently, after extracting Bae D ag os determataed 


¢ сап obtair i ; і 
we ci in D, by taking the square roots of the elements In ^ 
San obtain Vag; and with n determined we can obtain Vna by (3). This suffices to 
btaining Аа also it is computationally 


determi Bs, 
Pen the position; but to solve completely by о 
O use 
i к/к = MDXAg DA = Aa DNA. 
The solutions for Ал and Ав by these procedures are unique, ur 
WO-space geometrical А OOF of the i end of the proportional profiles position [3]. 


It wi едг? 
ill be n ; "qu ible i broken that th 
А ot ü jon is not possible if our assumption 15 ОГ‹ a e 
ed that a unique solut: noe ible if our assum nor remain unchanged. 


ariances 1 ; 
n due to the true influences (actore: D are unity or have the same value, one 


or 
more) of the factors cannot be uniquely rotated. 


thus confirming our earlier 


5 if one (or more) of the diagonal values in 


V. A WORKED EXAMPLE 
trated by an artificial problem involving seven 
nstructed as (1) to be free of sampling and ex- 
he usual simple structure method, 
esponding factors (though we shall 


Working procedures will now be illus 
and three factors. It has been 50 cons! 
and (3) al error, (2) to be susceptible of solution by t 
Not assur have known proportionalities between the corr 

Sume this knowledge in achieving our solution). 


affected data has to 


，1 The be based on least squares. This involves 
Minimize Computation for fallible, sample-aflee ro ronce matrix, Vos Vonk 
i ith г 


This can be verified by 
the elements of K, soning 


Writi 

thes hese i decer Rd Mine Taking partial Ret ee ments of K. This approach was suggeste 

Py Raa go zero and solving the resultant eaa? demonstrate this айр and Tucker Ll. "Gibson 

fas зоол used fo rented шога by [ ibson 114), Lubin (171, fey lated from Vog and RA ‘only, and 

Favor Sted (pereon отлы атов а NGT Sy impli Toa na peso oa ot 
ing's iterative procedure, 1° ,, However this leaves Ue K are not Symmetrical, and 


SOnsidera i sions which sented in FoB， 
(94 S: Tation. ۴ Unfortunately. for fallible experimental 
ол, V. ОВК yields a different result from KoB = "ot рціей as ? 
ор е wo Vog’ Vog. The two K values can po comp veraging, the corresponding elements on 
ite 1. хас i ix; but can ion ; 
Posie sides of att mius Чеүїсё; using mutua seer A ] 
YA, RANT Solution for Ал follows from [7], whic 
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Relatively small differences of variance, such as might exist in practical examples, were chosen, 
such that the factor loadings in the second matrix are respectively 2/3rds, 9/8ths, and 6/5ths of those 
for corresponding variables in the first. To give a local habitation and a name to the problem, we 
can imagine that the correlation matrix represents relations between seven tests involving the manipula- 
tion of mechanical puzzles, and that success is determined by factors of mechanical aptitude (F1), 
reasoning ability (F2), and manual dexterity (F3). The two matrices may represent data from boys 
and girls respectively, Fl having less variance in the boys and F2 and 3 in the girls. Let us begin, 
as it were, from behind the scenes with the two final, rotated matrices, representing the simple 
structure and the functionally meaningful position just described. These are set out in Table I. 


TABLE I. FACTOR MATRICES 


(As known to be structured) 


A. Boys B. Girls 
Tess | Find Fma Раа | W4 | Tests | Аав Рыв Рав | в 
1 60 —40 —10 | .53 1 40 —45 —412 37 
2 一 :50 —80 -10 | :90 2 一 :33 一 :90 12 93 
3 —10 -80 -00 | -65 3 —17 -90 00 84 
4 00 50 —60 61 4 00 56 —72 83 
5 90 —10 -00 82 5 60  —4l :00 37 
6 一 :60 -00 -70 85 6 一 :40 -00 -84 87 
7 05 -00 80 64 7 03 00 92 | 92 


For uniformity we have labelled the factors the А.л” and the Fna’s. It will be observed that 
the required proportionalities exist between the loading in each pair of columns, and that each factor 
in each matrix has a position of tolerable good simple structure, having three or more variables in the 

» hyperplane for each factor (generally within --0-12 but in one instance 0-17); and each variable wit 
at least one hyperplane loading [25]. 

The correlation matrices reconstructed from these two factor matrices, by taking the inner pro- 

ducts of the rows (Илл Vna’), are shown in Table II. 


TABLE П. OBTAINED CORRELATION MATRICES 


-58 一 .37 一 .17 一 :05 — 
—43 37 -06 一 .42 一 :53 — 
一 :05 -06—-01—-48 04 .53 — 


۰29 一 :10 一 .20 一 :06 — 
—26 23 .07—60—24 万 一 
一 :10 -11—01 —:69 :02 77 


Ra (Boys) Rp (Girls) 

Tests | 1 2 3 4 5 6 7| Tests 1 2 3 4 5 67 
1 一 e 

2 01 — . ыя 

3 —:38—.59 — P Ww = 

1 ف‎ 一 :17 一 :59 .50 — 

6 

Ж 


MOV Б шо юн 


А " Р " 1 
Taking these correlation matrices as our starting point, 1 А aningfU 
i inci ,let us t e at a meanine je 
rotated MD m * ру ae a E of proportional profiles ad à by rotation for sime 
Structure. centroid factorization of the correlation matrices yield: E rotate 
matrices (Table III). yields the following un: 
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TABLE III. UNROTATED FACTOR MATRICES 


VoA (Boys) Von (Girls) 
Tests | Fos, Fods: Fods n Tests | Ров FoBs FoBs т 
1 34 — —66 01 55 1 31 51 18 39 
2 59 29 一 68 89 2 71 32 ==58 94 
3 |-6 25 45 66 3 |-6  -—60 14 86 
4 |-75 -19 —07 60 и 21 05 82 
5 22 一 77 45 84 5 15 29 51 37 
8 34 86 11 87 6 4 ж —19 88 
7 50 37 51 65 7 6 —70 2 94 


Let us first solve from this point by the proportional profiles method From (6) we know 
Иол = VonK, 


whence K= VobVoA. (8) 
Our next step i t K by equation (8)， However, since FoB is not a square matrix, it cannot 
have an inverse’ te Hen jê possible which we may call the symmetrizing method. This has 
ail advantage that it is certain to yield D; whereas the trimming method (see footnote 2) sometimes 
с А By this approach we aim first at a symmetric matrix S, such that 5 = V;Vop. In this cal- 
s ation the whole Vip and Vog is used, yielding, in this case, a gx 3 matrix. Note that, since 
"a ola), and K = Иод а = Vod Van) Von, therefore K = S, Mood. | am 
ошаш? use of the square non-singular matrix 5 thus permits us to solve? for К; and we then 
= [0342 0396 . .8638 — -0070 0162 . [465 ,0615 -0993 
= E: 9396 225] x-[.o4 — 293 —391| KK (083 24 25418 
12225 .0379 1.4743 1035 一 1.0279 1-0563 
on the electronic computor by the method 


The latent ro 
ots and vectors were È 
ruman Kelley, adapted and reporte: Neuhaus [30]: 
0714 +8048 E 


2:33 я 

D2 1:53 [Г Ё - 

R { 一 | 2895 5487 — 7843 
| | в -| 97 a! 和 一 | 9545 —2266 8 


obtained 


“т d by Wrigley and 


۰76 
70 | | 
imag! Equation (8) and those at the end of the paragraph gannot validly be widen area dierent approach 
it пагу Concept of the pue of a rectangular matrix. Since we eventually s ?Sevice, is retained. It ig 
Tetained азкеа why this initial luton «ih its inadequate matrix tunas to follow. For, although a 
тес ned because it A logicall Paos direct and easier for non-mathematicians lO ipg А PA i ше тыд 
Sin angular VR cannot be sed a solution could evidently be Ded ps РЁ isast this would be 
So Tansformati FS k variables (rows) is appel no such sample of rows could 
јап арр о rimental and sampling етот monis the least squares fit for 


Бе! non-fallib ith expe 
Tul le data, though with exp here the only satisfacton Som this squaring of the rectangle 


the "presentati trix. y satis 
DER hole matrix fons се отеп with exact data ат тїп it has seemed best to carry through 
both Imming * footnote 1). But for maximum illumination * S secarison, though the former 
Will pihe crude? seul be seen below eater х symmetrizing procedures for comp: 
۷ упт out only шо "since it is not e i Po three rows each and then take the in- 
erse of irect * trimming ’ method we first trim oB 
the first (see (6)) as Follows: РР 21532 
—13:49 7746 54] 
Whi Vor -[ I _138 —11-70. 
Ich d i s: 
> Post-multiplied by the square matrix of VoA (trimmed), become: p 
2-657 ee 
1908 一 3.248 E 
К = [ :328 -960 " | 
ERO he value for K found from the * trimmed” matrices. 


2 
As wi 
S Will be seen, this result differs decidedly from t #0 


Factor Rotation for Proportional Rotation 


The values for Vna and Vig are given by eqns. (2) and (3); and the results are shown in 
Table IV. Ал can be reached, as indicated above, either by calculating K’K and proceeding 
therefrom as from KK’ (which will also give a check on D?) or by eqn. (1), p. 86. 


TABLE IV. PROPORTIONAL PROFILES SOLUTION 
(Symmetrizing Approach) 


Vna = VnBD (Boys) VnB = Vopg (Girls) 
Tests Еді FnAs FnAs Tests FnBı FnB2 FnBs 
1 53 —:42 一 "15 1 34 —-48 — 18 
2 一 "63 一 "77 -05 2 —:41 —:88 06 
3 —:14 -80 -08 3 —-09 :92 09 
4 -08 ۰52 —:56 4 ۰05 -60 —:67 
5 :89 一 '14 一 '03 5 58 —:16 —-04 
6 —+57 01 272 6 一 '37 *01 86 
T +08 ۰07 -80 T 05 ۰07 -96 


Except for errors from rounding, these values are the same as in Table I. Even in Ё? 
and F2, where the approximation is rougher, one has no difficulty in recognizing the similar 
patterns. A further example since worked by Haverland with improved mechanics O 
computing furnished the original values correct to four decimal places [16]. Furthermore 
the values for D—1-53, 0-87, and 0-84—are close enough for all practical purposes ; to the 


TABLE V. PROPORTIONAL PROFILES SOLUTION 
(* Trimming Approach ’) 


Vna (Boys) Ипв (Girls) 
Tests Fray Fndz Fndz Tests Еву FnBs FnBa 
1 :58 —46 01 1 32 —52 —12 
2 —58 —74 —-01 2 一 :46 —-85 12 
3 一 :00 81 -00 3 一 '05 :92 :00 
4 16 48 一 :59 4 :10 +53 3 
5 -88 —45 :20 5 -57 一 :20 :00 
6 一 .73 -06 -57 6 一 :40 09 85 
7 一 :12 :01 -80 7 01 02 :97 


р А 
The agreement is almost as good as by the other approach; but the D matrix failed, and the Ve ! 


matrix had to be calculated through K’K. The D matrix gave, not the known ratios, but 0:89, 4°79 and les 
presumably taking only three rows of a matrix is likely i reduce its rank, since four of the seven varia ye 
must be linear combinations of the others; and we are unlikely to hit by chance on those that woul Е need 
therank. Accordingly, since *symmetrizing gives more immed. ids the 
to find a pseudo-inverse, it seems the better procedure. 


w ided 
original, namely, 1-50, 0-89, and 0-83; and the closeness of these ratios to unity has provide 
a severer test of the method than we hope will normally be demanded. imple 
Now let us compare with the above proportional profiles determinations the E to 
structure position as reached by the usual trial and error search. This is done primati!” 


M : an 
1 It is interesting to compare this result with that fc * tri , ix К. The VnA 
Vap values so found are: ound by the ‘ trimmed’ matrix К. 
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iately what is required and avo! 


^ 一 一 - 


P 
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ү иеше ие computational times required and the role of ambiguity and approxima- 

Table IH p wo methods. А blind rotation for simple structure was carried out from 

iria y an experienced worker. Rotations proceeded steadily to a satisfactory 
ure, obtained after six over-all rotations for Vna, and five for Vng (Table VI). 


TABLE VI. SIMPLE STRUCTURE 


Vna (Boys) Vana (Girls) 
Tests Fray Fn: Fnas Tests FnBı FnB: FnBs 
1 65 —35 —06 1 40  —47 —-08 
2 一 :14 80 —04 2 —36 —90 07 
3 一 :45 —$83 11 3 一 :16 -90 00 
4 00 .45 —63 4 11 546—711 
5 92 -01 08 5 59 —4l 06 
6 一 .65 —-0l 67 6 一 :50 02 79 
7 —01 06 81 7 —09 00 96 


mighe nd rotation has thus succeeded in finding the original simple structure, though others 
е si nd an alternative structure. In their agreement with the true position (Table I) 
Te р! structure (Table V) and the proportional profiles methods (Tables IV and V) 
рон? 81У good. But in the wider applicability and in the analytical solution the pro- 
Case j nal profiles solution can claim superiority, at least when restriction to the orthogonal 


© 15 overcome. 


а 


VI. SUMMARY 
uw d. ТА "E factors correspond to organic 
ing е assumption is made that, when (and only when) factors соп і 
f aces inherent in the data, will their loadings alter hole by ratios peculiar to each, 
one situation to another in which hole is differently involved. 
. It has been shown іп the orthogon tion iE соге 
Шу by one unique paired rotation а оо nd tha position 
Overed with certainty by an analytic 80 ution. | d 
n example has been worked on errorless data, in which the solution was made to be 
cously the simple structure and the proportional profiles position. Re-working 
meth € correlation matrix, this posit ly found, without ambiguity, by two 
Ods. The former required about seven hours of compo le ; the latter 
Ours of tape- ing and ten minutes О electronic Z. А 
Ork in dem eem that the analytic solution can be obtained with great 


accura tiri 

tacy fo ; Accuracy, fora box problem with intro- 

n d or RI acceptable A 16], in which control was 
б 


е ychological experiment oor agreement 

p mpted of the factors believed to account for variar ЫШ a E pex of 
d the pre thod awaits the following 
. additional restrictions 
(ii) Allowance for the fact that, 


as impblnecessary to obtain a unique solut i 

Simp, ed in the de terr and Ledermann [24], variance а 

tune ion of ional change in the ord ory M individual loadings for the fact 

tt e individ iable loading. i i 

Variance experimenter OR be compelled to produce his experi 
© by selecting on a variable highly loaded in the factor, 
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. Cattell, R. B. (1955). ‘The principal replicated factors discovered in objective persona 


. Cattell, R. B. and Gruen, W. (1955). * Measurement of the primary personalit 


. Thomson G. H., and Ledermann, W. (1939). * The influence of multivariate selecti 
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MECHANISMS INVOLVED IN GROUP 
PRESSURES ON DEVIATE-MEMBERS 


By HERBERT A. SIMON and HAROLD GUETZKOW 
Carnegie Institute of Technology 


I. Pr oblem 

п. IL. Variables and Equations of the Deviate-Member Model. II. C i 
o TEM í : . Compariso. 
f Deviate- Member and Aggregate Models. IV. Cohesiveness and Rejection. V. Suman. 


I. PROBLEM 
d and tested a number of propositions about 


Leon Festinger and his associates have state 
The purpose of this paper is to carry a step 


с Бах 
dapes dion processes in small groups [1]. ie ) 1 
We shall = synthesis of a segment of these propositions into an interrelated system. First, 

ques ranslate some of Festinger's propositions into а more rigorous form, and examine 
exami €spondence between our equations and his verbal statements. Secondly, we shall 

ne an empirical study by Schachter [2] in its bearing upon our model. 

Which or many purposes it is convenient to describe a group as though it were an aggregate 
may be characterized by merely averaging or summing the individual characteristics 


Of all į ; ithi 
1 its members. But as we attempt to explore the internal processes within groups, we 
the leader) from the others.! 


ün 
ч wee to differentiate one member of the group (e.8., 
De; th the propositions Festinger has formulated for group pressures belong to the first 
average ey are concerned entirely with aspects of group behaviour that can be treated as 
sti Or aggregates for the group as a whole. Elsewhere [3] we have examined this part 
Of the inger's theoretical scheme. The present paper will undertake à parallel analysis 
E Temainder of Festinger's system—those propositions that involve specific reference 
mem articular member, A, of the group. The member selected for attention 15 а deviate- 
toward: and the principal variables in the system represent the attitudes and responses 
the the deviate-member of the remaining members. In a later section we shall comment 
Constr relation between the two parts of Festinger’s system, and between the models we have 
Ucted to represent them. x 
“stinger states five hypotheses about communication resulting from pressures toward 
tot aus that describe the behaviour of deviate-members. _ Of these two contain references 
memb communication among individuals in a group; one is concerned — er potential 
throu nb groups; the remainder refer to changes in composition of t group, ов. 
to relat Pushing members out of it. Our procedure will be to set forth our system, an en 
€ the model to Festinger's verbal hypotheses. 
QUA TIONS OF THE 


П. VARIABLES AND E UODEL 
VIATE-MEMBER 

эт [ Festinger—those numbered 2a, 2b, 2c, 4a, and 

50 i Consider a group of n members 


The five deviate-member hypothese: 
ver the group excluding A. 


~are stated j А ; iti e. 
and а 2518164 in terms of eight variables ns 有 9 


Particul : 
к ar member, А. The variab d а А 
Р, Ts The relevance to the group of uniformity in respect of the topic under discussion. M 
> The average P; n members of the group (excluding А) to communicate fo 4 а 

ez. ressure UPO 


tim 
ME 
in his The i } 
just Sa mport "—" been recognized © 
rain thet of Eu а ашаа We speak p am index of Who gether 1f either of these conditions 
Я ; Bener; Ө if all prices mo 1 i 
5 index be Movement of DEE Tis iran less. Thus, it is a pragmatic question as to пап ы of 
aggregate ao correspondingly must formulate à more complete mode! 1 d 
B model, or when Wi 
e: , д 
5 involves less aggregation- я 
8 


i economist in applying index. numbers 
ized by the ices ’ only if we are interested 


Mechanisms Involved in Group Pressures оп Deviate- Members 
Раб): The Discrepancy between the opinion of A and the average opinion of the rest of the group, 
as perceived by the rest at time t. 


Ca(t): The cohesiveness of the group with A—that is, the average intensity of the desire to retain А 
as a member of the group at time г. 


LA(t): The perceived receptivity (‘listening ") of A to influence—that is, the average estimate by the 
other members of his willingness to be influenced by them at time r. 


Ua(t): The average pressure toward uniformity of the group with A, as felt by the group at time t. 


C(t): The cohesiveness of the group, exclusive of the deviate member—that is, the average intensity 
of the desire of the other members to remain in the group. 


We now hypothesize the following system of equations: 


Palt) = Pa[D4(D, Оа(0)]; (see footnote (?)). (1.1 
40 _ раф), LAD. 02) 
AD L рш, Ual), Саб, LADI (1.3) 

Ua(t) = Ua[Ca(?), R, C]. (un 


Since Festinger has defined cohesiveness as ‘the resultant of all the forces acting On 
members to remain in the group ’, the level of C(t) at which members are indifferent as to the 
choice between remaining in or leaving the group could be taken as a natural zero for this 
cohesiveness variable. Analogously, Ca(r) may be defined as the average ner attitude of the 
group (other than A) towards A's remaining in the group. Then positive C4 would mean а 
net preference for his remaining, negative C4 would mean rejection—a net preference for 
his leaving. 

The four equations constituting our model postulate that the changes in РА and UA 
are instantaneous, while the changes in L4 and C4 take place gradually through time; as 
functions of the variables appearing on the right-hand sides of their respective equations. d 
discussion of the nature of such systems of algebraic and differential equations is presente 
in the appendix. iate 

We are now ready to examine Festinger's five hypotheses concerned with the devia 
member: 

Hypothesis 2a: The pressure to communicate about * item x ’ to a particular member of the 


group will increase as the discrepancy in opinion between that member and the communicato 


increases. 


This hypothesis states a relationship between D4 and Pa, as defined in our equation (1) 


DOP4 
and asserts that КУУ > 0. 


Hypothesis 2b: The force to communicate about * item x’ to a particular person wil 
to the extent that he is perceived as not a member of the group or to the extent that 
wanted as a member of the group. 


1 decrease 
he is 0 


This hypothesis states a relationship between P4 and Ca, via Ua (eqns. (1.1) and (140) 


and asserts that АРА >0 204 > 0. 


QUA. ° СА he ` 
" А . РРБ 5 H " i e t 
Hypothesis 2c: the force to communicate ‘item x’ to a particular member will incredesired 


шою e. perceived that the communication will change that member's opinion in the LJ 
Ire . * 


Т np, dd 
The present system of equations does not translate the full content of this proposition» A 
particular that part of it which refers to the ‘desired direction’ of change in opinion ysis. 
closely related proposition stated by Schachter [2] plays an important role in his ара 55" 


а ; Я 10) 
Schachter makes the assumption *that perceived difference increases with discuss" 


А tion 
1 Read: The magnitude of the pressure u i ime t is a fune 
5 D мє, роп group members to communicate to A at time / T. 
of the discrepancy of 4’s opinion from the rest of the ou. and of the 人 toward uniformity: 
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qom the actual discrepancy remains constant D, p.201]. In the present system of equations, 
s mechanism that is postulated to produce this kind of behaviour is slightly different. We 
т шп that, as discussion proceeds, the perceived receptivity of 4 to influence (L4) declines 
n € perceived discrepancy of opinion persists. This is stated in equation (1.2) with 


d 
«0, —— <0. Further, we assume that the lower is L4, the lower is the equilibrium 


òDa 1А 
Value of Са. This is expressed by equation (3.3), with 384 < 0, 84 >0. 
0C4 OLA 


By means of these additional assumptions, a high persistent value of D4 depresses L4 
(an. 1.2); this reduces C4 (eqn. 1.3), with a consequent reduction in Ua (eqn. 1.4), and 
simultaneously in Pa (eqn. 1.1). Thus, if the perceived receptivity of A to influence declines, 

0 also will the pressure to communicate to him. 

Hypothesis 4a: The tendency to change the composition of the psychological group (pushing 

members out of the group) increases as the perceived discrepancy in opinion increases. 


This hypothesis relates C4 to Da (eqn. 1.3). It asserts that x <0. 
Hypothesis 4b: When non-conformity exists, the tendency to change the composition of the 
Psychological group increases as the cohesiveness of the group increases and as the relevance 
Of the issue to the group increases. 
А р) QU, QU, 
This relates C4 to R and C via Ua and asserts that 84 <0; 4>0; 54 0 (eqns. (1.3) 
QUA oR DC 


and: (1.4). 
ab Our system now consists of the equations (1.1) to (1.4) together with certain assumptions 
Out the signs of partial derivatives of the functions that appear in them. For convenience, 


We collect the latter assumptions. 


Tb (1.1 a) Puso (115 
P<0 1.2< <0 (12b) 
ya зи (1.3 ©) g,^0 (1.3 d) 


0 (1.3 a) gy<0 (3b) gc«0 
c>0 (14а) Up >0 (1.40) veso (1.40) 


we have suppressed the subscript A, and 
designating the two variables without s 


9r notational simplici employed subscripts 
al simplicity, М 

у prote partial differentiation, ubscripts, R and C, 

and C' respectively. i 

Deviation and Rejection: The Schachter Experiment. A group of persons, including several 

Confederat AE Жесе исар together for discussion. The variable Da, where A 

es or * paid participants ’, is brought together for dis nged) behaviour of the con- 


15 one of * deviate °), i ined by the (pre-arra 

fedete (Пе confederates (the * deviate’), is determined by the (pr induced by the i 
àte, an ai Different initial values of R and C are induced by the in- 
tte, and hence remains constant. ‘The cohesiveness of the groups was checked at the 


Structions gi i 
Biven at the experiment. $ 5 
end of the сопе а perd found constent (see (2) Table П, p. 194); R is assumed to remain 
Onstant for the f. i f the discussion. Direct measurements of C and C4 were also 
made е forty-five minutes of tu nts were made of Pa(t). These were 
Мегара the end of the trial. During ds course, measureme 
€ valu i intervals. i й " 
( hg Schachter experimen t orovides us wi t i ped oe pottea шо Ue гарасы: 
Predi е can test whether differences in | 
Gy d differences in the terminal values of C4, istent with those predicted by the model. 
Wi ths of PA(/) are consistent ^^ as parameters, we obtain the 


s e cant time ра e 
Eliminating [ey GD and regarding DA, R, an ш) 
e 8 . 
Quation, . dLA = 4(DA, LA). 
dt (2.2) 


or = гира, Ua (Car R, С), C4: Lal 
d i iables from assump- 
TAE H e paths of the varia D 
t Sch 76 À nces about the tim d aro algebr 
raths about the iin his paper, attempts O derive infor eause the beso DUE i Chat perestved 
У E Of certain pere ly by introduci ecisely what assumptions underlie the 


diffesr than dj 5 1 is on 
Terence? differential equations, he can do MIS Cig cuit to know precisa done following Hypothesis 2c, 
* . 


$ lncrea i ii i A - 
ne p, creases wi time 1 
Wis post displayed in his Fig, 3. By mathematizing цад 
le to know exactly what assumptions are 
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Mechanisms Involved in Group Pressures оп Deviate- Members 


Consider now the direction field of the system in the (Ca, L4) plane, with the parameters held 
constant.| Since C4 does not appear in (3.5) we have: 


4] = (2.3) 
alco ® 
227 EL 24 
= >0. (2.4) 
BLA в = gyUc + £c 
There are grounds for a plausible assumption which enables us to say that these curves have à 
particular shape—namely, that ora (eqn. 2.4) approaches zero for very large and very small values of 


Ca. Then the direction field will have the general form depicted in Fig. 1. The argument for this 
is as follows: Equation (2.2) describes the change in the cohesiveness of the group with A. It seems 
reasonable that this mechanism is subject to saturation, i.e., that when C4 reaches very high levels, 
it will not be driven much higher by further decreases in D4. Likewise, when Ca reaches very low 
levels, it will not be driven much lower by further increases in D4. Typical paths for the system 
are shown from initial positions A, B, C, and D. Thereisa single stable equilibrium for the system at E. 

Now, referring to equation (2.2), and remembering our assumptions as to signs, we find that an 
increase in any one or more of the parameters D 4, R, and C corresponds toa shift to the left of £4 = 0; 
while a decrease produces a shift to the right of g4 = 0. Similarly, from (2.1) an increase in А 
means a downward shift of ¢ = 0 and a decrease in D4 an upward shift. It follows that an increase 
in one or more of the parameters shifts E to the left (and downwards if D4 increases); while a decrease 
in the parameters shifts E to the right (and upward). Fig. 2 shows the equilibrium curves for low 
(solid line) and * high ' (broken line) values of the parameters, respectively. 


FiG. 1.—Possible Paths of the Systems Fic. 2.— Positions of Equilibrium 
Arrows show possible paths (i.c., simultaneous E and E' are equilibrium positions of the system 
changes in L 4 and C 4) for various initial points, corresponding to different values of the para- 
A,B,CandD. The system will be in equilibrium meters appearing in equations (2.1) and (2.2). 


when it reaches E; for then, ¢ = 0 (see equation 
2.1) and a= 0 (equation 2.2); hence, L4 and 
C 4 will remain constant. 


For the particular values selected here, it may be seen that the equilibrium value of CA is posit 
with low parameter values, but negative with high: that is, a confederate will be rejected by the E ge) 
[CA(T) — 0] if he persists in a very deviant opinion (D4 large), if the group is highly cohesive (C ON 
and if the item under discussion is highly relevant to the group's goal (R large). Under the Em of 
conditions, the confederate will not be rejected. These conclusions are all consistent with the d 
Schachter's experiment [2, pp. 198 and 199, Tables III and VI].2 iables 

It will simplify matters if we regard the system as approximately linear, and measure the Varia 4 
as deviations from the equilibrium values. That is, we treat the partial derivatives as constant, 
take zero as the equilibrium value for all the variables. The system then becomes: 


РА = PpDA-4-PuUA. 29 
ta = $pDA+¢LLa. ёа 
dCA (2.7) 
ш 一 8DD4 十 8UU4 十 SCC4 十 BF 三 4 
UA = UcCa+URR+UCC. G 
1 For the methods employed here, see Lester R. Ford, Differential Equations, pp. 9-12» and A A 


Andronow and C. E. Chaikin, Theory of Oscillations, pp. 8-12, 182-193, and 203-208. oint 
2 It should be noted that the measures of C4 in Sc achter’s experiment did not determine the zero Pthat 


(i.e., the line between acceptance of A and rejection). The data show that the direction of shift © 
predicted b; the model. 
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Four possible paths of the system, originating at points a, b, c, d, respectively, are shown in 
3. Because we have taken the equilibrium point of the system as origin, a shift to the right 
sely equivalent to a shift to the left of g4 = 0 in the previous 
regarded as three possible initial points, with 
arger values of C, and/or R: (these larger 
fixed origin, or the initial point to the right 


Fig. 
of Hie initial point of the system is preci 
ub esentation of the model. Hence a, b, and c may be 
on same initial values of La, but with progressively 1 
fel ues, as we have seen, shift g4 to the left relative to à 
ative to an origin at £). 
From the shapes of the paths originating at а, с, and b, we conclude: . 3 
T Case (1): If C and R are low enough, Ca will increase during the trial from its initial value 
wards the equilibrium value (Path a); M x " npe 
ta Case (2): If C and R are high enough, Ca will increase during the trial from its initial value 
ward the equilibrium value (Path c) ; 
ge of values of C and 


Case (3): For some intermediate гап 
decline during the trial (Path 5). Ы . > 

at the changes in P4 will be proportional 
C, the changes in UA will be proportional 


R, Ca will first increase and then 


io anew for a constant value of DA, we sce from (2.5) th 
Юю ne changes in Ua, and from (2.8) that for constant R and | ) 
E e changes in C4. Hence the changes in PA will be proportional to the changes in Ca. We 
nclude that the time shape of the curve Pa(t) will be the same as that shown in Fig. 3 for Cali). 

Whi Itis in this way that we would interpret the data presented in Schachter's Table VII [2, p. 203] 
of EH measures P4 by the mean number of communications addressed to the deviate ' during each 
Schoor ten-minute periods in the course of the experiment, For higher values of R and C in the 
ichachter experiment, Ра) fell in Case 3, above. When С or R were at lower values, Pa(t) had a 
ume path corresponding to Case 1. Case 2 was not observed, suggesting that despite Schachter's 
audable attempt to “ reproduce the variables and phenomena under study with greater intensity In à 
Purportedly * real-life situation ’ " [2, p. 195], he fad not succeeded in reaching high enough levels 
d and R to produce the second case. However, he was successful in producing interesting qualita- 
and differences among the time paths of PAC) by using his four experimental combinations of high 

low levels of C and К. 


ii th. 
Fic. 3.—Shape of Time Pa of the time 


dence of the shape 


en 
trate depemvalues of C and К. 


h upon the val 


DEVIATE-MEMBER AND 


AGGREGATE MODELS 


3], we have developed a model 
h aggregate relations, 


The arrows illus 
pati 


III. COMPARISON OF 


encompassing five 


As mentioned in an earlier рарег [ i.e., are concerned 


wit ostio i ich deal wit 1 
me ee ee ae AN del consists of the following 
edunt, members of a group taken as à whole. The mo 
aD _ уур, LO, РО). вл) 
7 (3.2) 
P(t) = PID, UO) = 
140 = ЦОО. (3.3) 
4€ .. рф, UO: CO (34) 
dt 
u(t) = UIC, К}, а (3.5) 
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where the variables are defined as follows: 


D(t): The perceived discrepancy of opinion on an issue among members of a group at time /; 

P(t): Pressure upon members of the group to communicate with each other at time t 

L(t): Receptivity (* Listening ") of members of the group to influence by communications from 
other members at time /; 

C(t): Cohesiveness (identical with the corresponding variable in the deviate-member model). 

U(t): Pressure felt by the group to achieve uniformity of opinion, i.e. to reduce perceived dis- 
crepancy of opinion at time г; 

R: Relevance (identical with the corresponding parameter in the deviate-member model). 


Now let us contrast this aggregate model with the deviate model. The variables in the 
two models are quite analogous; but the equations are somewhat different. 

It will be noticed that the mechanism postulated in equation (1.1) is the same as in (3.2), 
but in the former case aggregated only over the subgroup excluding 4. The mechanism in 
(1.3) is similar to that postulated in (3.4), but has been elaborated slightly to incorporate 
the effect of L4—perceived receptivity of A to influence—upon Ca—the cohesiveness of the 
group with A. This mechanism would not produce essential differences in the behaviour 
of the system unless, as in the Schachter situation, there were forces tending toward subgroup 
formation. 

The mechanism that makes the presence of L4 in (1.3) important is equation (1.2). 
This states that, when there is a discrepancy between 4° opinion and that of other members, 


c 6 


>v 


E 


Fic. 4a.—Deviate-Member Model Fic. 4b—Aggregate Model. 
DIAGRAMS OF CAUSAL RELATIONS 


Arrows are drawn to each dependent variable from each independent variable that influences it 
directly, as postulated in the equations of the system. 


and as this discrepancy persists, the group will begin to perceive that A is unreceptive. | In^ 
perception in turn, through equation (1.3), reduces the cohesiveness of the group with n 
Equation (1.2) was not of importance in the aggregate model because there was no reas 
to postulate that the members would be highly unreceptive. B 
,,, Finally, equation (1.4) represents a generalization of (3.5) to take account of the poss 
bility of subgroup formation. In the deviate model, pressure toward uniformity Wit 55 
depends both on the internal cohesiveness of the rest of the group (C) and on their cohesivene 
with А (СА), a distinction we did not have to make in the completely aggregative model. 


By way of summary we give a diagram in Fig. 4a and 4b of the causal relations among | 
ables implied by the two systems of equations. For simplicity we assume change in уап : 
take place at discrete intervals rather than continuously. Thus, LA(t4- 1) is the value of LA 0 
interval after t—similarly for Са, D, and C. eral 
,. The deviate-member and aggregate models may both be regarded as special cases of a more gor the 
implicit model in which aggregates do not appear, but only variables for individual members ОЁ (2 
group. One special case (corresponding to our aggregate model) arises when the group behaves rises 
“undifferentiated ° manner, with similar mechanisms operating for all members. Another m5 
when the behaviour of one group member (ora small minority) becomes differentiated from the ot they 
while the behaviour of those others can still be aggregated—our deviate-member model. When deal 
are justified empirically it is advantageous to make these simplifying assumptions rather than ,dies 
with the more cumbersome general model. Similarly, in treating certain of the empirical Een 
with our aggregate model, we have ignored particular equations when the mechanisms they represe we 
were not operating in the empirical data being examined. In formulating the deviate-mode ate 
added implicitly a mechanism of primary importance that needed to be only implicit in the артел ос, 
model. This does not mean that we are explaining the several studies in terms of different thee wd Y 
but rather that we can conceptualize a general theory wherein the particular phenomena under 5 
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Tepresent special cases. For th: i i 

3 3 e present it seems unprofitable to writ i 

S ; i T T ite the more genera 

ио consist of composite sets of equations analogous to those used їп Hs io гона 
group member. They would bé complicated and difficult to handle, and at first glance do 


Not seem capable of yielding fruitful derivations. 


IV. COHESIVENESS AND REJECTION 


ihat tera concluding, we should like to comment on the cohesiveness variables, C4 and C, 
cal in the two models. Cohesiveness is several times defined by Festinger and his 
It eues as ‘ the resultant of all the forces acting on the members to remain in the group ’. 
ise = the role of an intervening variable between (a) the various forces that increase or 
Те паа attractiveness of the group, and (b) the behaviours that result from these forces. 
оеш nes as an intervening variable depends on whether, in fact, all the various types of 
RUM and negative attraction to the group do produce the same kind of behaviour provided 
y that their net intensity is the same. The evidence on this is far from conclusive. 

ec C is defined as the * resultant of all forces to remain in the group ’, the influence 
٤ А ractiveness of other groups is ignored. By interpreting C as the * net attractiveness ’, 
model seem that a wider array of phenomena could be encompassed in the aggregative 

el. Only one of Festinger's hypotheses considers the operation of the attractiveness 


9f other groups: 
Hypothesis 3c: The amount of change in opinion resulting from receiving a communication 
concerning * item x ' will decrease with increase in the degree to which the opinions and attitudes 
involved are anchored in other group memberships or serve important need satisfying functions 
- for the person. 
n a ‘net attractiveness ° interpretation of C(s), this hypothesis states a relation between 
and C via L and U, and is in fact identical with hypothesis 35: 


Hypothesis 3b: The amount of change in opinion resulting from receiving a communication will 
ч increase as the strength of the resultant force to remain in the group increases for the recipient. 
Add is, if C is the net attractiveness of the group, it can be increased either by increasing the 
the is to remain in the group or by decreasing the attractiveness of other groups. Hence 
© aap in 3b: ‘strength of the resultant force to remain in the group ° is the negative 
aa analogous variable in 3c: ° degree to which the opinions and attitudes involved are 

Огей in other group memberships’. 


In this context, it may be well to call the reader's 


nes; i з T 
5 variables in the deviate-member model, wherein we regar 


1 ini " * 
5 again is an effort to define the variables in the system m 
a indicates that no new 


е ^ 
is A model with Schachter's dat 
Ssary to account for Schachter's results. E 
forces of other groups from the attracting forces of 


thi one wanted to distinguish the attracting S x 
the group upon which attention is centred, two cohesiveness variables might be used, as Cı and C2; 
Net attractiveness’ then being Ci— C2 = С, ог cohesiveness. , ; 

i i i s ambiguous. The consequences 


attention to our interpretation of the cohesive- 
ded negative values of CA as ° rejection ”. 
ore parsimoniously. The agreement of 
“rejection * variable, apart from CA 


Temainin 

ng in the whole group. The ? ы 

(ay UD Situation; (1) The Боан may consider the alternative of withdrawing from the group, 

B sitive to its attempts to control his opinions and behaviour, 
е or more members who cause 


Percept 

ion of the distributi 

u e dis: roup opinio: Oss! 

ро; tribution of group n membership in other groups al d е 

veness, SH models all inv implicit assumptions with respec: lese 
M da to be interpretable as the average inten 


erception 
tions. 

The variable C would RIE ejection of one or à few members. 

i mberofthegroup. Noattempt 


tbe best i i 
interpreted as the average des P do wish, however, to call attention to theirimportance 


“vas ma 
Int € to carry out these refinements. ч 
һе further development of models of this kind. [ , ; 
The utility of mathematics in dealing with phenomena 


Cor ; 
9 тте, j 1 
the king К on Mathematization. by no means universally accepted. What advantages 
9 We think ieu ет undertaking? We have tried, in examining Festinger's 
ni ys 
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propositions, to illustrate concretely some of the advantages derivable from the construction 
of a formal mathematical model: 

(a) Knowing more precisely what mechanisms or structural relations are being postu- 
lated, and sometimes calling attention to the need for further clarification of the operational 
meaning of definitions and statements; 

(b) Discovering whether certain postulates can be derived from others, and hence can be 
eliminated as independent assumptions; whether additional postulates need to be added to 
make the system complete and the deductions rigorous; and whether there are inconsistencies 
among the postulates; 

(c) Assisting in the discovery of inconsistencies between the empirical data and the 
theories used to explain them; 

(d) Laying the basis for the further elaboration of theory, and to deductions from the 
postulates that suggest further empirical studies for verification; 

(e) Aiding in handling complicated, simultaneous interrelations among a relatively 
large number of variables, with some reduction of the obscuring circumlocutions entailed 
by non-mathematical language. : 

In the long run, mathematics will be used in the social sciences to the extent that It 
provides a sufficiently powerful language of analysis and exposition to justify the time an 
effort required to use it. Social phenomena have proved sufficiently baffling; whether our 
inclinations are mathematical or literary, we cannot afford to prejudice what is the ‘one 
best ` methodology. Let methods, like substantive theories, prove by actual trial which is the 
more powerful, and under what conditions. 


V. SUMMARY 


1. In this paper we have examined certain hypotheses concerned with pressures toward 
uniformity upon the deviate-members of groups, and have endeavoured to develop them 
into an integrated, coherent system. 

2. We have devized a model containing feed-back mechanisms to provide linkage among 
such variables as the perceived receptivity of the deviate-member and the desire of the group 
to retain him in the group. | 

3. Derivations from the model made it possible to check the system's congruence with 
empirical findings in an experimental situation. The check confirmed certain assumptions 
incorporated in the model, but showed also that many of the hypotheses set forth by Festinger 
and his associates simply were not being tested by the experiments. The mathematica 
derivations made it possible to predict time paths for certain of the variables and to chee 


these predictions against the data. 
4. A comparison was made of the deviate-member model with a homologou 


developed for groups without a deviate. The two were found to be special cases of a m 
general system. bé 

5. Two seemingly independent variables, cohesiveness and rejection, were found to d 
interpretable as merely opposite ends of a continuum. The new interpretation yieldee 
der;vations which were consistent with empirical data, and the consolidation helped increas 
the parsimony of the model. 


s model 
ore 


APPENDIX. SYSTEMS OF DIFFERENTIAL EQUATIONS 


р ; а f 
The models with which we have been dealing attempt to explain a set of phenomena in term a 
equi 


four or five mechanisms, each mechanism being represented by a differential or algebraic ТОГУ, 
Implicit in this method is the assumption that systems can actually be observed, in field or laboratc 
whose behaviour is determined by these mechanisms and which are isolated from other systems 
sufficient degree of approximation. _ 
Consider a system of z differential equations in n variables: 
dxi 

т = Л» Xn) (i — 1,42 

А : n zd 1). 
Arrange these equations in order of the * rapidity ° of the adjustment processes: (cf. below, Р. 10 
Let us suppose that we observe the whole system at intervals of т minutes (or milliseconds, O" ero 
and over a total span of T minutes (or milliseconds, or years). Now if the system starts at time 


(0 
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from a di ‘libri ui i ; 
will ا‎ position, certain of the variables (the first K, say, which adjust most rapidly) 
ively reach their equilibrium values (within, say, +1 per cent.) within 7 minutes, and hence 


bef 1 
ore the next observation of the system. For these we shall have at all times (approximately): 
ЛО, Xn) = 0 G=1 0) 


Which gi i 

mieh bes & algebraic equations to determine the first k variables in terms of the remaining (п — А). 

values at ДВ the remaining (n — К) variables, some may have such slow adjustment processes that their 

заш ime T are effectively the same (within, say, = 1 per cent.) as at time 0. Suppose that these 
variables numbered from (m-+1) to т. Then the corresponding equations can be eliminated 


f 1 
ону), End the variables treated as constant parameters: à . 
(19, тога Whi left, then, in addition to equations (2), a set of (т— К) differential equations of the form 
priate sel which we can determine the time paths of the variables Xj cles Ате We see that by appro- 
Select ОЕР of the time units, 7 and T, and of the initial position, (1,..., x), of the system we can 
фу qe e OUS аца = of the equations (1) for study. Moreover, if we study the behaviour of the 
ie., th it moves from various initial positions, we can study the effect of various of the parameters 
E € variables xj. 1,..., Xn) upon it. 
the system of (1) is linear, we can ma 


of the system is then: 


ke these statements more explicit. The general solution 


(3) 


n yt 
х= È aye 
fat 


Wher ۴ ee 
are е 08 x's are measured from their equilibrium ү 
nstant. The system is stable if and only if al 


negati 
ve the real part of A, the more rapidly does the term ] 
wi Bere in order, so that the A with largest real part comes first, and so on, those adjustment processes 
rapid for which all the a’s except the first few in (3) are very small, while those adjustment 
t the last few in (3) are very small.2 


Proce: i h 
5ses will be slow for which all the a’s excep 
ompartmentalization of a general system 


inen This is Y 
into j not, of course, the only way of conce tualizing the с tm д 
esign pendent subsystems, Pros (Um. тт proposal along different lines see W. Ross Ashby, 
lo the ora Brain (1952), particularly Chap. XI, XIV-XVIII, and XXIV. The whole problem is related 
(ed.) псе оп of causal ordering. See Н. A. Simon, *Causal ordering and identifi 
1953 тийе in Econometric Method, Cowles Commission for Resear: 

， 5 In conversati " pu ау classification of 
into A Conversation, Dr. John W. Tukey has pointed out that this three-way classe fication of processes 
volvo Pid (high frequency), * slow E (low. frequency), and * intermediate d (iñtermediate frequency) is in- 
ye every empirical analysis of time series. The over-all period of observation, T, places a lower limit 
T keys ce duencies that can be examined, and the interval of observation, 7 an upper limit. While Dr. 
the Belk comments refer to the oscillatory components of time series, and ours to the exponential components 
Mechanic involved is the same: any given set of observations only pern 
sms at work, and these are the mechanisms which are neither to 


h 5 
relative to 7 


alues, the A's are complex numbers and the a's 
1 the A's have negative real parts. The more 
аде! approach zero. If we now arrange 
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TEST RELIABILITY ESTIMATED BY 
ANALYSIS OF VARIANCE! 


By CYRIL BURT 


Historical Note. The idea of determining the * precision ' of an estimated i ii 

М i ste quantity from a ser! 
A gepented measurements is due originally to Gauss (Theoria Combinationis е ы Т $09). 
NE introduced into psychology by Fechner who proposed a Prüzisionsmass im Gaussschen Sinne, 
first о = 1/0۷2 (Elemente der Psychophysik, 1859, pp. 103 Е). Clarke Wissler was apparently the 
Мол EE that * the precision of a test may be estimated by correlating successive trials * (Psychol. 
E 1901, p. 60). The correlation so obtained has generally been known as a ' reliability co- 
He 2: it seeks to answer the question— if this test were repeated, how far would the results 
ESH The traditional way of raising the question would have been to ask: ‘if this test were 
of ana, how much variation would there be in the results?’ Тһе suggestion that the * reliability * 
the = B and examination marks might be more effectively studied by analysing the yariation—with 
"i id either of factor analysis or of what Fisher has termed the analysis of variance—was, I believe, 
8 put forward ina Memorandum I was asked to prepare for the International Institute Examinations 
Sites te in 1934; anda number of theoretical investigations with these newer techniques have 
€ been published.4 Their adoption for practical purposes has, until the last few years, been 


comparatively rare. 
Definitions. Both in this Journal and elsewhere several writers have recently complained 
term reliability in 


d ib in spite of the vast literature that is available, psychologists still use the t 
ан loose and ambiguous fashion '.5 What kind of ‘agreement’ or * variation ' js to be regarded 
a manifestation of ‘ reliability’ or the reverse? We must therefore begin by defining a little 


More exactly what it is we are seeking to assess. The implicit notion is that of ‘error’, which is 


1 Editorial Note. This paper is intended to form one of a series of elementary © expository notes’ 
The formulation of the problem and 


Which the Journal Committee has suggested should be published. г 1 
o rigorous as would be desirable in an 

0 th 5 ht appear technical or even pedantic 
€ ordinary reader I have, so far as possible, relegated to footnotes. The greater part is taken from 

Т. of Variance (1943) referred to by 
- Mahmoud in the paper that follows. These notes in turn were amplifications of the Memoranda 
see Marks of ‘Examiners, 1936, and The 


s from the fact that in psychology most of the early 
ера At the turn of the last eu when Sum writers 

n to substitute Teutonic terms for French, Zuverlässigkeit win proposed as a synonym for Präzision; 
(uas, for example. PERSA by Krüger and Spearman to designate Wissler's * test-retest correlation " 
is Я is English writings Spearman used the ordinary English 


itsch. f. Psy hi: 
ia f. Psychol., XLIV, 1906, pp. 43)... In Ris ENG and Thorndike use the пе ОП reliability" in 


Statisti ly aided by the work алаад лсе оу юе їп d 
Astical Departi i ity College, articularly Professor К. А, isher, Dr. P. O. Johnson, an 
Sd eyman : Ein. Bea festes for teaching purposes I was much indebted to various research 
ents, particularly to Miss M. Howard and Mr. К. W. B. Jackson. ] ж 
So f. more particularly P. О. Johnson, * Tests of Certain Linear Hypotheses and their Application to 
abili, Educational Problems ’, Statistical Research Memoirs, I (1936), 57-93, К. W. B. Jackson, The Reli- 
Testo ої Mental Tests’ ms» Sb cho; ХХІХ, 267-87, and G- A Ferguson, The Reliability of Mental 
941) and refs. — ' S Р 
ROR E ‹ liability of Repeated Measurements’, Bull. Brit. Psychol. Soc. 
No. 23 (19545 E Es enbe LM ETE б, id. A. Heim, The Appraisal of Intelligence, (1954), 67 f. 
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itself a somewhat elusive concept. For statistical purposes a working definition may be derived 
as follows.! 

In psychological research, the investigator continually requires to measure as accurately as 
possible certain aspects or types of behaviour (which we will call * traits”) manifested by certain 
individuals. The most elementary observation that he can make, therefore, will consist in using (i) 
some experimental device, which we call a fest, to elicit and assess the specified trait (ii) in some 
particular individual (iii) on some particular occasion. In any such experiment, the particular 
test, the particular person, and the particular occasion will each be specimens selected from a whole 
universe or population of such tests, persons, and occasions. We are thus concerned with variations 
in three distinct dimensions (see Fig. 1). 

Accordingly, let xj, denote the observed measurement actually obtained for the ith individual 
with the jth test on the kth occasion (i = 1,..., N; j= ly... 2; А = 1,..., m); let £j be the ° true, 
measurement, i.e., the measurement we should obtain if we possessed a perfectly efficient * test °? ; 
and let ej; denote the error of the measurement actually obtained. Then, adopting algebraic 
addition as the simplest way of relating these quantities, we may write 


Хук = ijt ijk ОГ Eijk = хук—©йў, (i) 


and we may take the latter equation as a formal definition for the ‘error’ of a single 


observation. 

This formulation involves two unknowns, but only one equation. To obtain equations that are 
soluble, the most obvious procedure would be (i) to secure measurements for the same person with the 
same test on a number of different occasions. In practice this may not always be feasible. Often 
it would be far more convenient (ii) to obtain measurements from the same person on the same occa- 
sion with a number of different ° rests’ (e.g., different problems or * items ' in a composite group test), 
or (iii) to obtain measurements on the same occasion with the same test from a number of different 
persons, or finally (iv) to use some combination of these various alternatives. We could then extract 
an estimate, not for the amount of each error (є;), but for the general tendency to error : e.g., with the 


first procedure 
| Kem 1 k=m 4 
ej-— У | ху) or se= | У (ху )21, 
т «=1 Un—1 kel 


where m as before denotes the number of occasions on which the test is applied. > 
Types of Error. Following Gauss and Fechner, psychologists commonly distinguish two main 

kinds of error : (i) systematic or * constant’ errors, which'are more or less predictable and so avoid- 

able; (ii) chance or ‘ random’ errors, which are unpredictable and unavoidable, Неге we shall be 


1 As Dr. Ehrenberg points out, most definitions involve a number of dubious assumptions (ac td 
1 
f. 


р. 41): some of them indeed I myself believe to be unnecessary. However, Dr. Ehrenberg's cri 
of Professor Gulliksen's assumptions in The Theory of Mental Tests (1950) appear unduly severe : but ct- 
Dr. Guttman's review and Dr. Gulliksen's reply (Psychometrika, XVIII, 123-34). it 

? On the whole this simple definition of a ‘true measurement’ seems to me the best : I have phrased s 
so as to cover ‘ true measurements ’ that are purely hypothetical quantities as well as those (if any) which ar. 
actual quantities, e.g., Tom's innate general ability (which by hypothesis is unchanging) as well as, вау, 
Tom's visual acuity or his efficiency in arithmetic. But the popular notion that this or that characteristic ha: 
in fact a * true ’ measurement, if we could only find a perfect instrument for discovering it, is to ту n is 
a highly dubious * metaphysical’ concept (* metaphysical’ in the pejorative sense in which the OP BS 
used by the logical positivists). In psychology more particularly almost every specifiable trait is an Unsta int. 
fluctuating characteristic. Its measurement involves the determination of a region rather than а роте 
We can, if we like, define the particular point which, as it were, forms the centre of gravity about which t y^ 
momentary values appear to vary. But these internal variations do not form part of the * unreliability 
of our test or other metrical technique. 

To circumvent what I have called the metaphysical difficulty, it is tempting to fall back on 
of operational definition : the * true ' measurement we may say is merely * the limiting value tow: 
the mean of a number of measurements progressively tends as the number is indefinitely increase 
(in more modern terminology) * the mathematical expectation over the whole universe of trials ’ : this V 

к=т А 
be equivalent to defining it by some such equation as ё=&(х)= i У x; (т о). But there are pitfalls 
m 9 


ind 
some kin 

ch 
ards uo 


ould 


in this definition. In particular, we shall be liable to include in the * error’ the variations due to change? id 
the trait itself as well as those due to the imperfections of the method of measurement. Hence І show 
prefer to treat this equation as an inference from the definition I have givenabove. For the general appro? in 
adopted in this section see F. N. David, Probability Theory for Statistical Methods (1949) chap. ^ * ch 
accordance with the continental practice followed by Dr. David and others, I use & rather than Æ for 5 

m 
operations as p^ >- 

k 
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concern <clusi i 

prd [A duc eR mein Lenin кү" ا‎ effect is usually described as one that 

PT d yl ber small independent causes, about which nothing i 

and the Рр се Tshonld prefer to follow Keynes, and сало em 

severdi f ‘independence’ ”in terms of ‘irrelevance’. From iti 
important corollaries can then readily be deduced. If the error e ia Basile A 


t H \ 
he sense defined, then as тт increases, 


TES 7 - 
(i) fo ,24 — 0, and hence? (ii) £x = x > ё; 


(iii) гое = (екеу) — 0; similarly, (iv) rge — 0; 


and (v) with mea: i i i i 
M asurements of the kind with which the psycholo h: istributi 
Res i i ologist has to deal, the distribution of t 
Ee tend to the normal. With suitably defined conditions these corollaries will also hold w 
they wee the expectations refer to data obtained on a number of different occasions, but also when 
er to a number of different persons; and most of them can be subjected to an empirical 


check, 

sec н of Reliability. (a) In Terms of Variation. Since we frequently need to com- 

Белер reliability of different tests, we cannot leave the measurements of error in units specific 

popula test or trait. The time-honoured device is to take as unit the standard deviation of the general 

variam tion, or (by preference, since it is additive) the square of the standard deviation, i.e., the 
ance. Дав can be estimated by choosing а random sample of the population, and calculating 

; d 
Xxx). On comparing the two variances it would then seem possible, on intuitive 


N—1* 
he measurements for a single individual (say the ith, 


i 
T А " 

Brounds, to infer that, when the variance of tl 
or the entire sample of different individuals (s), the test used will 


5 
m becomes as large as the variance fe 
e acii ч а 
as оо practical value whatsoever : for the whole object of such a test is to distinguish the ability 
of the sured for any given individual from the abilities of the rest. On the other hand, if the variance 
measurements for a single individual were zero, we should pronounce our method of measure- 
be measured by the ratio 


m : haer E 
OF thee be entirely free from error. The unreliability of the test will then 
individual variance to the total variance treated as unity. And for any given test j we may 


Calculate the quantity 


S 一 


г, = 1— Sesso 


Whi 2 n 2 a 2 А ; е 
hich will evidently be 0 when So, = 3o and 1 when se, = 0; and we may regard this as estimating on 
ES aed 0 to 1 the precision or reliability of the test. It will be seen that this approach treats 
e ility as an invariant property of the test (where * test * means not merely the test-material but 
p process of measuring the specified trait for the specified population). 

it will faras the assumptions we have made hold good with the sample of persons actually measured, 
and dU" a matter of indifference whether our sample of trials is obtained by repeated testing of one 
e same typical individual or by testing а number of different individuals 3; for, by corollary 

cf. also M. G. Kendall, Advanced Theory of Statistics, 

In the notation adopted by Keynes, X, is said to be ir- 
of Х on the evidence Х\й is the same as its prob- 
(X variable X is a member of the class C’, and let 
$(X) mean * X has the characteristic or the value $ ' ; dom ' member of the class C, 
= obability Ф(Х)/С(Х).!, or (better perhaps) if Ф(а)/С(а). h 
Ths ^j iti RA EY [on 
1€ "theorem i an i/h ani (o/h are 
Said to be рт ог Independen са indepen fined in terms of statistical frequency 


of X: 
hithermore, £X, X2 = (EXE X2) oF 3i ХХ А 

Provides a useful test of independence + unless the covari: 

t justifiably assume tha! 


the text I have om! 

the exp! 
f imply that the sample 
hich can nearly alway’ 


t Xı and Аз are statistically independent. 
tted certain q 
ectation of the sample mean is 
mean converges 


2 > 
To avoid complicating the arg 
s be assumed in 


Ti&orous st d 
9 atement wi ге; CE 1 
in p to the quedes 104, n. 2), this would not of itsel 
prae iobability to the population mean : still, under conditions w! 
ice for data of the type here considered, it will converge. И ; 

Тез р this point, I think, my treatment diverges from that adopted by Dr. Guttman ( Basis for Analysing 
Dr, Gt Reliability ’ Psychometrika, X, 945, pp. 255-82—an article which every student s ould study). 
+ Guttman restricts error variance to the variance of measurements for the ith individual, taken about the 
105 


even supposing that 
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(iv), there should be no relation between the size of the errors and the size of the true measurements. 
But in practice, if there were a possibility of some such relation, it would plainly be safer to draw our 
sample of errors from the whole range over which the test is to be applied. On these grounds, as 
well as those of ordinary convenience, it will be better to measure all the individuals at least twice, 
and then substitute the mean of the error variances in the numerator and the variance of the observed 
means for the different individuals in the denominator thus obtaining an alternative formula : 


ga = 4 s x . c O 


. , Now, even if the test had no diagnostic value, the means of the measurements for the different 
individuals would not be exactly the same. We should in fact expect their variance to be approxi- 
mately equal to the error variance. Hence, when the variance of the observed means exceeds the 
error variance, we may reasonably attribute this excess to differences in the true measurements : 
in fact, it follows algebraically from eqn. (i) that, if (as we have assumed) the true measurements 
and the errors are uncorrelated, sz = sgt Se. Accordingly, we may estimate the variance of the * true И 
measurements from the difference between the two. Thus, in practice, the formulae just reached are 
equivalent to saying that the reliability of the test can be defined as that proportion of the total 
variation observed which is attributable to the variation of the true measurements: 1.е., 
2 2 
Sr 一 Se 


. aii 


where the circumflex accent denotes that the variance of £ is merely an estimate. 

(b) In Terms of Agreement. If the differences between the measurements obtained for one and 
the same quantity provide an indication of their unreliability, it is natural to treat their agreement 
as an indication of reliability. At its simplest this agreement can be assessed by computing the cor- 
relation between the two series of measurements obtained with identical forms of a given test (j andj’), 
applied under identical conditions, namely, 


2 


= = N). 
V xy zx 


xx 


Now Уху xy X Ete eitey) = E E+ tiet S ey +E «jey, and this (by corollaries 
ii and iii above) tends to J ¢ ; as № increases. Similarly X xj PX ES PI and X xy? 

A 2 Р РЧ 一 
» &+ 3 ej. Moreover, since by hypothesis the two forms of the test are identical, Х= 
= ae Hence, under the conditions assumed,! we may write 

seg 
m — M D 2 ча ede 
ха+5а о? 


xx = 


or, substituting estimates, 


2 2 2 
d Sx 一 Se Se (у) 
== icem. Р 5 s : Ш 
Sy D 
In passing we may note that, since 
2 2 
Оо, O; 
Fxx'— = апі g et 2 Tey, 
ox ох ogo, V(2E Xx) 4 


we may take 4/rxx' as an estimate of réx, the ‘ index of reliability’ (Kelley, Statistical Method, 12a 


p. 206, eqn. 160). Hence, unless the ‘criterion’ (i.e., the independent estimates obtained fOr 


mathematical expectation and over * an indefinitel: i ials " i i of the 
8 : d y large universe of trials’. This gets rid of many ^, ical 
pumpilon dsl dore d tied , Now, Rone an economy of uc ойор, is "estable in Hee p 
ems in general wis i 8 veri 
extent they are fulfilled. 8 iser to retain them, and so far as possible, to ver y а 
1 For eqn. iv to be valid a less stringent requirement wi impii есе55й. 
i foni оша serve. As the proof implies, the n С. 
and sufficient condition is merely that the two columns of measurements to be comalated Шо Tiomoscedast! 
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trait) is itself as unreliable as th it is qui i : 
validity Pope ees хаз thie it i еа erroneous to state, as is so often done, that ° the 
E Equation iv expresses in its simplest form a mode of approach first suggested by Galton, namely, 
e re bee of variation into independent components or * factors’: if the total variance, с 2 can 
divided into two independent and additive portions, one due to the factor common ja 
тава ue other due to the factors specific to either variable (g2), then the eer peras 
AE total Roh Pus of measurements can be expressed by the ratio of the common factor variance 
Bras to the fact that the non-statistical psychologist always finds it easier to think in terms of 
ot ions, he has, as a rule, nearly always preferred to compute a kind of * self-correlation ° by 
рөн the familiar product-moment formula. But the expression we have reached (eqn. v) 
Шы Rr die same value can be got by adopting an analysis of variance instead of the more 
ihe е ше. Such ап analysis will have several obvious advantages : it will enable us to check 
крче ш assumptions, to obtain the same coefficient by a more direct calculation, to construct 
АА orms of the coefficient where required, to adapt the concept to cases where the test has been 
Кеч {07 y once or generalize it to cases where the test has been applied more than twice, and to 
trial e treatment to investigations in which all three sources of variation—persons, tests, and 
s—and even others as well, can be compared simultaneously. 


I. THE RELIABILITY OF A SIMPLE TEST 


iet TMustratiye Example. The procedure proposed can be most simply explained if we start with an 
d ual example. Let us suppose that, by means of a booklet comprising five sets of questions, we 
ve measured the intelligence of six pupils, chosen at random, and have applied the test on two suc- 

he trials are set out in Table I. The 


cessive occasions. The detailed marks 3 obtained at each of t 
which give the averages for the five subtests, will 


no Tows of figures near the bottom of the table, у 
of n the final marks 4 for the first and second trial respectively. The reader may, if he likes, think 
shall f figures as * intelligence quotients *. To deal with the problem in its simplest form first, we 
Eon or the moment ignore the data for the constituent subtests, and merely ask what is the reliability 

he composite test, judged by the ‘final marks ". 

бе identical tests, or two equivalent (that is, interchangeable) forms of the same test, have been 
а ог the two trials, we may expect that both the means and the standard deviations will be 
exproximately equal on each Occasion. Since the occasions are themselves samples, we cannot 
ре exact equality ; but what small or non-significant discrepancies there may be can be treated 
ps àn incidental result of the unreliability. Accordingly, the best estimate of the mean for the entire 
Coe will be the mean of the two averages obtained for the whole group оп the two occasions 
B .l. Similarly the best estimate for the standard deviation will be derived from the average ofthe 
em Square-sums taken about this mean. The coefficient of correlation obtained from estimates so 
mputed is termed the intra-class correlation ; and, as I have argued elsewhere, it appears to be the 
d on 2N—1 instead 


a suitable form to adopt for this purpose. It has the merit of being base: 7 
— 1 degrees of freedom, and thus provides a more accurate estimate than the ordinary product 
the formula will be (R. A. Fisher, Statistical Methods 


pus Coefficient. On this basis, therefore, 
esearch Workers, 1935, p. 199) ү 
ME m e 
int = 3XG-3»M 07—03) 
* they must therefore be 


ir tests have been shown to be valid, 
Some of Dr. Eysenck's tests, for example, have 
buta test with a theoreti- 


. 1 Several wri i 
n iters have argued that, if the 
[oor Cf. Н. J. Eysenck, this Journal, УТ, 1953, pp. 44), Some ol Dr liy 
sical vali " admittedly a high figure for tests ОЇ per? Pb 
fal validity of 0-70 Ee 0 reliability of БАЕ 0-702 = 0-49, which is not a satisfactory value for a re- 

Ability coefficient, NN i 
А is instructi e the ratio here reache with the expression 10 n 
ca drguments асо сонр theory. То take the simplest case, let the * true message ' to be communi- 
€d be denoted by ё and the obscuring ‘noise’ by e and let the message as тесер be denoted by 
o; +0; 
ured by +1082 t т 5, The formula 
с 


for‘ information communicated * 


the information gained can be meas! 
є 

inane i iplicity of variables and fact 
generali mplex cases where a multiplicity of yana es and common factors 
ized to mentho po fies 1948, and Burt, this Journal, ТУ, 1951, pp. 193 f.). In my fuller 
оге satisfactory derivation of the formulae for reliability could be 


E 9 
ê +e: ‘then, as Wiener proves, 


d in terms of in! 


adoption here. 
A UO Га Mather, ‘Statistical Analysis (1943), Bi 73, Table 12. The 
les give values to one 


IS 
line of approach seemed too tec 
t (р. 111). To save space the tal 


тазе detailed figures are taken from 
Son for the choice will be explained in a momen 
based on two or more. 


ес! 
imal place only: the calculations are 
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where x denotes the first set of N measurements and x’ the second set, and x denotes the mean of the 
entire series. The formula, it will be seen, departs from the more familiar version by substituting 
the arithmetic mean of the two variances for the geometric. 

Applying it to the average marks obtained by the several persons at the two successive trials 
(the * final marks ’ shown in the last two lines but one of Table 1), we obtain 


"X 1052-85 
int 113945-81--2436-84} 


With the same data the ordinary inter-class correlation would be r = 0-5288. The discrepancy, 
here unusually large, is due mainly to the rather big difference between the means for the two trials. 

The Coefficient as a Ratio of Variances. The same value can be reached by calculating the three 
appropriate variances from the square-sums (i) of the observed values, (ii) of the means for the several 
persons, and (iii) of the deviations about these means. 


一 0:3299. 


TABLE I. UNSTANDARDIZED MEASUREMENTS 


Test Occasion John James Hugh George Ralph Henry Average 
M 1 81:0 1466 823 119-8 98-9 86:9 102-58 
M 2 80: 1004 103-1 98-9 66:4 67-7 86:20 
м Av. | 808 123-5 927 1093 82-6 73 94-39 
S 1 1054 1420 713 121-4 89-0 77-1 102-03 
5 2 823 1155 105-1 61-9 49-9 66-7 80:23 
S Ау. 93:8 1287 91:2 91-6 69-4 71:9 91-13 
V 1 119-7 150-7 78:4 1240 69-1 78-9 103-47 
у 2 804 1122 1165 96:2 96:7 674 94-90 
V Av. 100-0 1314 974 1104 82:9 73-1 9919 — 
m 1 109-7 191-5 131-3 140-8 89-3 101-8 127-40 
T 2 872 1477 1399 1255 61-9 918 | 10900 
T (Av. 98-4 169-6 — 1356 133-1 75-6 96-8 118-20 
Р 1 98-3 145-7 896 1248 1041 960 | 109-75 
P 2 842 1081 129-6 75-7 80-3 941 95°33 .— 
P Av. 91-2 1269 1096 1002 92:2 95-0 102.54 — 
Final Marks | Av. 1 10-8 — 1553 9-8 1262 90-1 88-1 109-05 
Final Marks | Ау.2 830 1168 — 1188 91-6 71-0 77-5 93-13 — 
Final Av. 924 1360 10535 1089 80-6 82-8 101-09 


Expressing the average marks (near bottom of Table I) as deviations about the final mean, we 
obtain the following figures : 
Square- " 
Sms Divisor Variance 
a fAv.1 + 17， 十 54.2， 9-3, 4251, —110, —13-0 9 
¢) es 2 18-1, 415-7, +177, — 9-5, 301, Be} 63827 12 531 
Gi) Means — 82, +349, +42, + 78, —20-5, —183 21221 6 3537 
And the deviations from these means will be 
E Av. 1 十 99, +193, —135, +17 . : 
Gi) {Rr Av.2 99. 5193; XH uda + з} mns qa 19 
Dividing the square-sums by the number of items summed, we obtain for the total variance 531% 


for the variance of the means 353-7, for the variance of the individual deviations 178-2. Ha ilar 
test no diagnostic value, we should expect the observed means to show a random variance, Sim 
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to the variance of the indivi viatioi We may 353-7 —178:2 = 175-5 as our 

i, individual deviati 

ti t 1 lations. e may therefore take s i B 

estimate of the variance of the * true’ measurements ; and thus obtain for mE estimate of the reli- 
f the reli: 


ability (in accordance with equation (iv)) 


9; 175-5 
йч ы «а Зы 
deg 7 5179 0-3299, 


the 
Sime fenm as we reached by the method of correlation. 
lysis of Pooled Variance. For practical computation the foregoing procedure is needlessly 


cumb ; = 
ersome. By far the quickest method will be to compute, direct from the observed values in the 


usual way, the i 
5 square-sums needed for a sim 1 lysis of vari is wi i 
enable à ms led. imple analysis of variance.! This will at the 
5 to test the statistical significance of the coefficient so computed. The DP Кун 
as it were, latent in both sets of measurements 


is sh i 2 
(hat [oe ш Даре П. Note that, since the means are, 
e first trial and that for the second), the sum of square-sums * between means ' must be 


entered as 2x 2122-1. 


TABLE II. ANALYSIS OF VARIANCE 


for Two Applications of a Composite Test 


Source of Variation Sum of Degrees of Mean Variance Prob- 
Squares Freedom Square Ratio ability 
Vp = 8488 2:38 «0:05 


= 42442 | (М—1)= 5 
6 | p= 3564 


Vs = 1205-2 


B 
vite means for persons Qn 
measurements for persons | О, = 2138-5 N(m—1) — 


Sum Qs = 6382-7 |(Мт—1) = 11 


of us value for the intraclass correlation can now be obtained from the ratio of the difference 
О component square-sums to their sum.2 We have in fact 
‚ү Qp— Or 4244.2—2138:5 _ 2105-7 
biased) /'; k - = 0:3299. 
(biased) rin = GFO, 42442421385 63827 Qoa 


of fact, however, the formulae so far give! 
biased, and in general too low. The rea 
he three variances, I divided by * the number of items summed '. 
we obtain the * mean squares ”by dividing 
n correct for the bias 
fference and sum of 


n for the required cor- 


Unbiased Estimate. In point der, indeed, h 
er, indeed, may have 


rel 2 
noticed ged an estimate which is 
ith Епа moment ago, to obtain t 4 1 

y the * dea s method of carrying out analysis of variance, We tain 
Y usin, eder of freedom”. And, in accordance with this principle, we сап. 
the 15118 the difference and sum of the estimated mean-square in place of the di 


Square-sums. We then have 
Vp—Vr 848:8—356:4 _ 4924 . 0-4086 
5 Я 


(unbiased = yy, — 8488-3564 = 1205- 


The А 

Correction is here rather large; with bigger samples it is usually much smaller. 

thus found, we calculate a variance-ratio 1n the usual way 
i esults of our test, we should 


To ti Had 
(2.39! test the significange of the value 
m led i Ud it will be жеп that if we had no Other Me th us wo E 
sinifcanenferthat the differences between the means fat nti that (he “mea = 

Tsong 1, This inference, however, de ends on the aain imate for the error of measurement, 
availa a for making such an esti 


uti» a д 
tin the orgen in the table, provides the best 
M ginal table of detailed marks we have for iy discover, 
of * constant err 


lenis t much 


Weh 
a 
of not 
What ;. Yet used ; е h as 
thar Bat is hete 500 and this further information, ature 
oe qe cluded in the estimate ЕУ ^is really variations due to peculiarities of the s 
ly of predi i w far these irrelevant variations 


fore now consider Pall the detailed marks. 


ons about the final mean, it 
s: this gives us 
20, which we 


lest, ran 
can pad of the етог > and consists large 
© iso lated different occasions. Let us there 
or allowed for by undertaking а СО! 
deviati 
eem is then sum the square: 


828 ever, th Iready be 
8-6 quickest 77» ће observed values have alres y betî ferences, and 
thee 4 and 491 ust to square (a) their sums and (b) their difera ice usual way, Les 20р and 
309011 in a exactly twice the square-Sun in thi aragraph- 
a 560 Facto ormula analogous to that given in t жар А 
ее Fisher of the Mind, p. 275, eam. + putt 
er, loc. cit., p. 213, and Factors of the 
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II. THE RELIABILITY OF A COMPOSITE TEST 


A. UNSTANDARDIZED MARKS 


The Three-Way Analysis of Variance. When a composite test or battery has been applied to a 
group of testees on two or more occasions, then, as we have seen, we are really faced with three ! 
main sources of variation : (1) the differences between the persons tested, (2) the differences between 
the tests employed, and (3) the differences in the conditions obtaining on various occasions when the 
tests are applied. Moreover, each of these main causes may interact with one another, so that, in all, 


+ 


A 
| 
| 


Р <--~_ 


Fic. 1 一 Analysis of Variance with Three-Way Classification. 
' Main Dimensions of Variation and Interactions. 


n sources: 


the complete analysis will include seven possible sources of variation, namely, three mai timate 
d est! 


Дон interactions, and one second-order interaction which will be used as ап 


tion : 
Snglish 


: H г у 1 the investigation i ]ker and - 
in the introduction to their chapter on * Analysis OE Variation is reproduced БУН. Me Classification 
jdanc® cal 


:OD Hon 
1 р riatio 
hree axes of co-ordinates the three main sources. of а 


and occasions); the three coordinate planes the 
teractions or*error'. Cf. Figure 1. 
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Our first task therefore i 
Jur f ore is to analyse the total vari 
€ 1 ariance (o; i 
will жыз ысы from these. various sources. Em ekle nissim p pete aes а 
Proofs involved and ie Ped ed in moss textbooks ofstatistics.! То save араас vo 
Eee ce RUE, iled working instructions, the example I have chos Et 
EO OUR AE ready fully analysed in what is perhaps the most illuminati sen. (Table D consists 
УЛЕА T р luminating discussion of the subj 
step WI . The student who wishes to foll i Sue 
р will find them fully set out in the reference given.? The dee ane арз abl I 2 
able III. 


TABLE Ш. ANALYSIS OF VARIANCE: UNSTANDARDIZED MEASUREMENTS 


Source of Variation Sum of D 
egrees of | Mean i ili 
Squares Freedom Square bw Probability 
Marn, Effects 
. Persons (Qp) 21220- 
E p) 21220-9 5 42442 30- 1 
a Tess : (Q) 53100 4 1327-5 95 20001 
. аѕіопѕ (Qo) 37985 1 3798-5 273 «0-001 
днае Interactions 
. Persons and Tests (Ом) 4433-0 20 221-7 ч 200-0: 
ГО: Personsand Occasions | (Оро) 6893-9 5 1378-8 55 9 осоо 
. Теѕіѕ апа Оссаѕіопѕ (Ою) 2918 4 73:0 — 一 
баши Order Interaction 一 = 
O. Residual (Омо) 2784-2 20 139-2 
Sum (Qs) 44732-3 59 


пене ИЛ рост of Variation. There are three preliminary problems to be considered : what evi- 
às a whole (ij) in afford for differences (i) in the general ability of the persons as tested by the battery 
Which the te ii) in the severity or difficulty of the.several subtests, and (iii) in the conditions under 

n e tests are applied on this occasion or that? Theanswers are clear. The effects of all three 


p uem of variation are fully significant. 
and ED three variance-ratios, that due to differences between the * persons * tested is the largest, 
designed pres to be fully significant. These of course are the differences which our tests were 
investigatio measure ; and it is therefore encouraging to find that, even with so small а sample, the 

The ion yields results that are statistically significant. 

data we variance ratio due to differences between the two occasions is almost as large. With actual 
Possibl Misit infer that the difference was due to the fact that the second battery is slightly harder, or 
Some c О some unfavourable circumstance affecting the conditions of the second trial or again to 
ange in the pupils themselves : had the second mean been higher, we should doubtless have 
d improved their general performance; as it is, we might 


argu d : 
gued that familiarity or practice ha ir gene 
produced a decline in interest and keenness. x 

i maller. Here the differences 


wonder wh 
ether the loss of novelty has 
i € vari, bt far si 
eht be due aia due lo di erences Р OF, bte5 erences in the method of marking. 
r to differences In! Aen that the tests were still capable of considerable 
arded as significant; 


Пап 
1 actu: 
mprovement research such results woul 

sand tests c hardly be the case of the inter- 


bu The Interacti, s 
t the ;."leractions, The interaction between person. nd ое саті; die in the 
i: to be sidual variance an 


acti eracti а 
9 ction b ccasions is full 

therefore Ween t ateen person and a; sout smaller than the re 

o en tests and occasions the Variance turn : 

i Let us therefore glance in 

ion between persons 


507 entirely devoi sior А 
Pasina dents of psychology usualy And the ferm Consider obscure: eraction between g 
tests, To calea figures ms i here deny the pair of marks (one for each trial) obtained by 
4 ai ther, ор. cit., рр. 

D E erit M. G. Kendall, The Advanced Theory of Statistics (1946), Ib PP- a 
à. Lov, 63 f. d by R. A. Fisher (1943), рр, 72-79 (Example 
Foreword by 及 .和 as the results of (65002 the crop- 
ccasions. But to 


2. К.М M. Walker and J. Lev, ор. сії» PP" EON a RS 
ight be descri e У t 
on two pud ined (as stated in 


fer Mathes Нет, Statisti Ор. iology, 

li er’s dat tistical Analysis in iology» 

makety data were ujtural : they me i 

e di > in fact agricu t € T seed, SOWn Я 

the ће pr, ferent soils by using 5 differen! kinds û dent of psychology, 1 e illustration of a three-way 
1 (1, p. 10, Table ГУ). 


тесе PrIocedur SING ? ligi 
“ced re more telligible to tests. 
Ysis of ye Section) cem е results Of г paper in this Journa 
nce with vas give! 
actual tests Was £" 111 
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each boy in each of the tests. As there are 6 boys and 5 tests, we get 30 averages. We then reduce 
these averages to deviation form by columns and by rows : in other words, we subtract first the means 
for the several tests and then the means for the several persons (or vice versa). The final result is the 
* doubly-centred measurement matrix " set out in Table IV. It will be seen that every row and every 
column adds up to zero. On squaring the figures and taking the sum, we find that the square-sum 
for these 30 values is 2216-5. But the original mark sheet contained 60 values ; and there each boy's 
average mark for each test is incorporated in the initial data twice over, once for the first trial and 
once for the second. In fact Table IV only represents half the interaction, namely, that for the first 
trial : the other half, that for the second, would be exactly thesame. Over the whole table, therefore, 
the interaction between tests and boys contributes 2x 2216:5 = 4433-0 to the total square-sum ; 
and this, it will be seen, is the value inserted in Table III above.! 


TABLE IV. INTERACTION BETWEEN PERSONS AND TESTS 


Person John James Hugh George Ralph Henry Total 
Test M 一 5-34 一 5:84 一 5-91 1-15 8-79 145 0-00 
S 10-92 2:67 — 416 一 7:30 = 115 — 0:98 0:00 

у 9-06 — 2:69 — 5:95 3-11 4:25 一 778 0:00 

T —11:55 16:45 13-18 714 — 22:07 — 315 0-00 

P — 3:09 — 10-59 2:84 — 10-10 10:18 10-76 0:00 

E 
Total 0-00 0-00 0-00 0-00 0-00 0-00 0:00 


Sum of squares = 22165; 2х 2216-5 = 4433-0. 


What does such a table mean? Inspection of the figures suggests that both tests and boys tend 
to fall into distinguishable types. James, Hugh, and George appear to do better than the rest at the 
T-test, and worse at the P- and M-tests ; John, Ralph, and Henry, on the other hand, tend to do worse 
at the T- and S-tests, and better at the P- and M-tests : there are, however, marked exceptions. 
This doubly-centred matrix of measurements is in fact equivalent to the table of residuals from which, 
in an ordinary factor analysis, the bipolar factors would be derived after the general factor had been 
removed (though in practice such factors would be calculated from the residual correlations, not 
from the residual test measurements). Indeed the only way of deciding how precisely the boys or the 
tests should be classified would be to carry out the appropriate factorization, А doubly centre: 
matrix like that we have just computed must have a rank of either (n—1) or (N— 1), whichever 15 
smaller. Here, therefore, had the sample of persons been large enough to justify such a procedure, 
we could have extracted as many as four bipolar factors.* 


TABLE V. INTERACTION BETWEEN PERSONS AND OCCASIONS 


Person John James Hugh George Ralph Henry | Total 
Ist Trial 1:97 1133 — —21-50 9-30 L56 — 266 | 000 
2nd Trial —197 —1133 2150 — 930 — 156 266 | 0:00 
Total 0-00 0-00 0-00 0-00 0-00 0-00 | 0:00 


Sum of squares = 1378-8; 5x 1378-8 = 6893-9. 


The interaction between persons and occasions is fully significant. The detailed figures are 
shown in Table V. They suggest the possibility that experionce with the tests in the first trial affects 

1 The calculation here described would in practice be a very cumbersome way o ing the square 
sum for the interactions ; it is much quicker to subtract the Mu for the E at the 
end of the computation (see Mather, loc. cit., p. 75 and Table 14А). 

*] have discussed this particular interaction at some length because, as Mr. Mahmoud remarks m 
previous example (this Journal, 1, p. 23) has apparently led certain readers to suppose that only оле elemen jed 
factor was needed to explain each interaction. In that earlier illustration no more happened to c needs 
because, to simplify the exposition, I selected an inquiry in which only two tests were applied on each cjs 
two occasions ; hence all the interaction matrices there had a rank of one. For a detailed factor-analys 
of a doubly-centred measurement matrix, such as that illustrated above, but obtained in a psycho 
research, see Burt, ‘ The Analysis of Temperament’, Brit. J. Med. Psychol., XV, 1938, pp. 166 f., eSP- Та 
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. Occasions 2 i 4 1700) 3798-5 
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with the scheme originally suggested for purposes of a group factor analysis,! arranging all the inter- 
correlations for the same day in square submatrices astride the leading diagonal. When there have 
been only two such applications, the entire correlation matrix will be quartered into four submatrices. 
The coefficients for the first day will then be placed in the N.W. quarter, since these will presumably 
have a group factor of their own ; those for the second day in the S.E. quarter, since they will contain 
a different group factor; while the cross-correlations will be placed in the N.E. and S.W. quadrants, 
since they will involve neither group factor, but only the common factor (and perhaps specifics 
in the diagonal). The next step is to sum values in each of the four matrices ; and, if a group factor 
analysis is to be carried out, the subtotals for the rows and columns will be required as well. We can 
then calculate what Pearson would have called the * bimultiple correlation ' between the two sets of 
tests, with or without differential weighting. The use of differential weights we can defer until we 
consider the application of factor analysis. Mr. Mahmoud gives an excellent illustration of the 
procedure in the paper that follows (cf. Table II, pp. 121 f.). To save space, therefore, I shall 
here omit the detailed table of correlations and give only the final results. As will be seen from 
Mr. Mahmoud's example, once the correlation table has been constructed, we have merely to take 
the sum of the figures in the N.E. (or S.W.) quarter for the numerator, and the mean of the sums in 
the N.W. and the S.E. quarters for the denominator. With the present data, if we take the geometric 
mean of the two last sums, we obtain the ordinary intercorrelation 


e 9-8041 
= 722-0912 x 18-9680) 


if we take the arithmetic mean, we obtain the intra-class correlation 


= 0-4776. 


r = 0:4789 ; 


18-9680) 


Fint = 
E 


The difference is now almost negligible. 4 

Analysis of Variance. We shall obtain exactly the same figure for the intra-class correlation, 
if wecarry out detailed analysis of variance along the same lines as before. Our method of standard- 
ization has completely eliminated the variance between tests and between occasions, and therefore 
also the interaction between occasions and tests. Hence there now remains only one main source 
of variation (the differences between the mean marks obtained by the several persons—the result 
in which we are chiefly interested) and two possible types of interaction. The effects of these three 
components are shown in Table VII. 


TABLE УП. ANALYSIS OF VARIANCE FOR STANDARDIZED MARKS 


Source of Variation Sum of Degrees of | Меап Variance Probability 
Squares (Q) | Freedom | Square (V) Ratio 
P. Persons 6-0667 5 1:2133 22:39 «0-001 
PT. Persons and Tests "7038 20 +0352 — E 
PO. Persons and Occasions 2:1451 5 -4290 7:91 «0:001 
PTO. Residual 1:0844 20 ۰0542 — жЕ 
S. Total 10-0000 50 | (-2000) 


on between 
riance. 
less 


As before, both the differences between the means for the persons and the interacti 
persons and occasions are fully significant, the former exhibiting much the larger amount of va 
But the interaction between persons and tests has become so small that its mean square is even 
than the mean square for the residuals. and 
We can now calculate the intra-class correlation as before from the square-sums for persons 
for the interaction between pupils and trials. We obtain 
m О»— Оро _ 6:0667—2-1451 
int ° Opt Opo 60667421451 
Since the degrees of freedom are now the same for both square-sums, the previous * correction for 
bias * (substituting mean squares for square-sums) leaves the value for the coefficient unaltered. у ia 
. The Factoria! Analysis of Variance. We have seen that a reliability coefficient is intende of 
indicate the ratio of the estimated variance of the ‘ true’ measurements to the actual variance 
To avoid 4 
* submatri 
io be more 


0.4776. 


1 C. Burt, * Factor Analysis by Submatrices *, Journ. Psychol., VI (1938), р. 349, Table I. 
technical term which is apt to alarm students who are unfamiliar with matrix algebra, each 
whose elements were to be summed was called a * pooling square’. The phrase seems now 
frequently used of the entire correlation matrix as thus partitioned. 
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the observed measurements, i.e., to the * total variance * conceived as the sum of the ‘ true variance ° 
and an ‘error variance". But how do we know that the value taken as the numerator in the ratio 
just calculated really represents the * true variance ' we have in mind, and that it does not incorporate 
Something that we might (if we knew its real nature) also regard as error? 

It is, I think, seldom recognized that an * analysis of variance °, as ordinarily carried out, does not 
really indicate the relative amounts of the total variance attributable to each hypothetical factor 
taken by itself. Its purpose is primarily to test the statistical significance of such factors, not to 
assess their relative importance. For this latter purpose the * square sums ’ and the * mean squares * 
must be subdivided still further. 

.The need for a supplementary analysis will be clearer if we first set out, in the form of a hypo- 
thetical model, our conception of the way the observed measurements are composed. Adopting the 
same style of notation as before, our hypothesis will be that хуу, the observed measurement for the 
ith person obtained with the jth test on the kth occasion, is made up of four additive factors, each of 
which. is normally distributed in the population with a mean of zero, and varies independently of 
variations in the others. Thus, we may write Я 

хук = Yit Ti+ wijt Eijk» 
where the new symbols designate the following factors : 

(i) A general or basic factor, yi, specifying the extent to which the true mean measurement 
for the ith individual deviates from the mean of the population, £ (which we may, if we wish, set equal 


to zero :) i.e., 
yi7 ёё. s > . E . « (УШ 


__, ii) A supplementary factor, Tij, possibly complex, specifying the extent to which the ith in- 
dividual's measurement for the jth type of test deviates from what we should expect in terms of his 
own general mean and of the population mean for that test (p: ie. 


m= yêt : 0 s s 0 09 


es (iii) A supplementary factor, wij, also possibly complex, specifying the extent to which the ith 
individual, tested with all the tests on the kth occasion, would deviate from what we should expect 
in terms of his own general mean and of the population mean for that occasion (£4) : i.e., 


apai tates „ ЕЮ = Бу КОКУ Q 


. ..v) An error factor, є, specifying the extent to which the observed measurement for the ith 
individual, obtained with the jth test on the kth occasion, deviates from what we should expect in 
terms of the three foregoing factors : i.e., 


Eijk = xik—(yi- туі). ° . < . Ск 


From this it will be clear that the observed means and variances cannot, as they stand, be taken 
as unbiased estimators for the corresponding factor measurements, since each of them includes a 


contribution from the other factors, usually in diminishing amount. Thus 
zo de 
xi = pn ZZ Xii 
jk 


jS ead S wet ED Uk nome ш; 
二 do 
yi my y m, jk 


Squaring а ming the several means and their analytic expansions in terms of these factors, 
we readily ара the t ie equations for estimating the variances of the several factors. Then, 
substituting the values given for the variances in Table VII, we reach the estimates shown below 
(Table УШ), where, as before, V; = Op/(N—1), Vpt = Qpil(N 1)(m— 1), Vpo = Qpol(N— 1)0т— 1), 
Vpto = Qpiol(N— 1)(m—1),(1—1), and hence Vs = Qs|(N—1)mn. Аза check, we note that the 
value now obtained for the sum of the several variances should agree with the value for V; given in 


able VII. . EM 
I have deliberately given the results of a mechanical application of the formulae to the present 
data, because it illustrates one or two instructive points. First we observe that an impossible 
estimate is reached for зу. The negative value is due to the fact that we have treated the results of a 
chance fluctuation as representing a separate factor : it implies that, with the data here analysed, 
it was unnecessary to include in the hypothetical model any such factor as ту. This, of course, 
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should have been inferred from the attempt to determine the significance of the interaction between 
persons and tests in Table VIT. But, since several correspondents have been puzzled by the occasional 
appearance of a negative result at this stage, it seemed useful to illustrate how it may arise. 


TABLE VIII. ESTIMATES FOR THE VARIANCES OF THE POSTULATED FACTORS 


Variance Equation Estimate 
si = ур Ин Ино Vpro) +0803 

© = Н (Ир Кро) 一 :0095 

52 - 1 (Vpo— Vpto) 0750 

$ = Vpio -0542 
zs =й -2000 


Secondly, let us apply the foregoing analysis to the interpretation of the ordinary reliability 
coefficient. We now have! 


И-и, 2 52 
РШЕ з یی‎ ЯВ eii 
yt DP n, So) 十 Ез; 
:4051— :0095 


:40154--3750 — -0095+-0542 
= -4776 as before. 


(у) common to all the tests? Or are we intending to secure a more general estimate, which wou 
include the specialized abilities of the separate tests as well?? Іп other words, is т (if significant 


a composite test (or battery) once only. I have discussed this problem more fully elsewhere, 


therefore need say very little here. К 

The relations between this procedure and the older and more familiar method тау be mos: 
conveniently explained in terms of the pooling square. At first sight the idea that the consistency j 
two or more test-applications can be estimated from a single application only seems highly paradoxica™ 
1 Since this section is intended to be of general application, I here give the general form of the equation 
for the unbiased intra-class correlation (see Factors of the Mind, p. 275, eqn. ii). Since in the prese! 
example m = 2, the equation reduces to the forms already used above (pp. 109 and 113). m- 

2 The relations between the two types of component are very similar to the relations between the sou 
mational or centroid factors obtained by * simple summation ’ and the group factors obtained by а “go n- 
factor analysis’: with the former the * general factor ’ includes not only the * basic’ factor but also € 
tributions from the group factors, and similarly with the successive bipolars. This is confirmed by the 
of an actual factor analysis by these two methods : see below, p. 118. ally 

3 For example, when using a battery of verbal and non-verbal intelligence-tests, our aim is genero rg 
to measure intelligence only; when using question-papers in arithmetic, English, and the like to PPF all 
* general educational ability ' in a scholarship examination, our aim generally is to assess a compoun led 
the child's educational attainments. Elsewhere I have tried to distinguish the various types О! SO Са ож» 
j rely coefficient °’, and to give them more distinctive names : (see also Mr. Mahmoud's paper БЕ 

. 127f.). NA 9 

BEA See Burt, ‘The Reliability of Teachers’ Assessments’, Brit. J. Educ. Psychol, XV (1945), esp 
Appendix, Pn. 90-2. 
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Priate statistical erences in the tests, then they must be eliminated (so far as possible) by an appro- 
correlations s formula. Now, if there were no changes and no differences whatever, then the inter- 
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Will, as befo se assumptions we can now construct a suitable pooling square. The N.W. quadrant 
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Quadrant Шоста! application : as usual, it will have units (1:00) in the leading diagonal. The S.E. 
equivalent sere sad the results of an imaginary second application carried out with perfectly 
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quadrant will b e exactly the same as in the N.W. quadrant, and the side correlations in the N.E. 
in the NE ре exactly the same as the side correlations in both the N.W. and the S.E. quadrants : 
correlations9uadrant， however， the leading diagonal will no longer contain units, but * reduced self 
conforming with the assumptions just laid down. The only question is—what values 
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The point is clearly illustrated by the present data, where the two trials were separated by an 
interval of 12 months (not, as our assumptions require, by the minimum time needed to prevent the 
effects of the first trial influencing the second). The sum of the 10 inter-correlations (without the 
self-correlations) obtained at the first trial was 17-09. Accordingly with the first method (inserting 
actual reliabilities for subtests) we obtain 4 


1709-1177 _ 1886 esas 
17-09+-5 1:00 2209 — 
with the second (inserting estimates based on the average intercorrelation) 


17.09--5x-.85 _ 21:34 _ 
17094-5x100 2209 


Reliability determined from Weighted Factors. The factor measurements obtained or implied 
by an analysis of variance consist of the simple sums or means of the observed test-measurements, 
without any differential weighting. As I have argued elsewhere, better estimates of reliability can be 
secured if we employ weighted sums, the appropriate weights being determined by factor analysis. 
Here, for example, the summational and the group factor analysis alike reveal that, on both occasions, 
Test P has a far higher loading for the general or basic factor than Test T; and this in turn greatly 
augments the size of the interaction between tests and persons. 

It is largely for this reason that I have repeatedly urged the need, in any exact inquiry, to base the 
estimates of reliability on an explicit factorial analysis by a summational or group factor method.! 
But other questions frequently arise which only a factor analysis can solve. In an ordinary analysis 
of variance the most suitable form of classification is often a matter of some doubt—a point that is 
commonly overlooked. The ‘degrees of freedom’ can always be subdivided into ‘ orthogonal 
comparisons ’ in many different ways : and, as Mather observes, “ the first task is to partition these 
degrees of freedom into the components that are appropriate to the various comparisons which might 
be interesting ".? But how can we tell which comparisons “ might be interesting "? Shall we 
examine those that represent the special abilities conceivably required for particular groups of tests, 
and if so which groups do we select? Shall we look for those that represent ‘ function-fluctuation 
in the persons, or, it may be, fluctuations in the conditions obtaining at successive trials, which them- 
selves may affect different persons differently? Or again how are we to analyse the interactions 
into their constituent components and assess the importance ofeach? With large groups, or numerous 
subtests, or numerous repetitions, such problems often prove crucial. And nothing but a pre- 
liminary factor analysis can show how they are to be answered or which of the supplementary 
classifications are really worth while. 

Since our initial table of data was based on so small a sample and does not represent any actual 
testing, I need not here reproduce the tables of correlations and factor saturations. Moreover, the 
method has been fully illustrated in previous papers.3 For completeness I will merely summarize 
the final results. Three summational factors and three group factors were extracted. Their con- 
tribution to the total variance (10-00) was as follows : . 

(A) General factor, 6-11 ; first bipolar (classifying occasions), 2:37 ; second bipolar (classifying 
tests), 0:35 (non-significant) ; 

(B) Basic factor, 4-48; group factor for the first occasion, 1-88; group factor for the second 
occasion, 2:33. 

It will be seen that the variances of the general factor and the bipolar factor for occasions stand 
in approximately the same proportions as the crude variances Vp and Vpo in Table VII; while the 
basic factor and the two group-factors for occasions stand in almost exactly the same proportions 
as 5, and 52. This is what, if the foregoing theory is correct, we should expect. So far as it goes, 
these points of agreement confirm the close relation between the principles underlying factor analysis 
and those underlying the analysis of variance. However, in many actual researches, where the diag- 
nostic values and consequently the weights of the several subtests differ widely from one to another 
the agreement no longer obtains; and in such cases a reliability coefficient based on a factoria 
analysis will manifestly be far more accurate and far more informative than one derived by the more 
usual over-simplified procedures. 


966. 


1 Which factorial method is to be preferred will depend on the investigator's conception of the factor 
or factor measurements whose reliability he desires to assess. Thus I should not entirely agree with МГ: 
Mahmoud who appears to hold that the group factor method alone is appropriate for a// such problems. 

. ?20p.cit., p. 73. Kendall observes that the choice is * as a rule determined by some prior classification 
given among the data of the problem ' (Advanced Theory of Statistics, M, 175). But in psychology, insteae 
of being given with the data, the classification has often to be deduced from the data ; and for this purpos 
a factorial analysis may be indispensable. icl 

3 An excellent illustration is provided in this issue by the tables included in Mr. Mahmoud's artic 8: 
If any reader wishes to use the correlational tables relating to the present data for purposes of class-instru' 
tion, copies can be obtained on request. 
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OF FACTOR THEORY 


Bv A. F. MAHMOUD 


The Need for a More Adequate Theory. Ina previous issue of this Journal 1 I raised the question 
of assessing the reliability of composite tests (like the newly constructed battery of intelligence 
tests on which my colleagues and I have been engaged) ; and the Editor in reply drew attention to 
the possibility of using more refined techniques, such as have recently been developed from the familiar 
principles of factor analysis and analysis of variance. In several published researches one or other 
of these newer methods has been tentatively adopted. But the investigators have usually failed to 
extract all the information implicit in their data, either because the methods have not been carried 
far enough, or because they have been applied too mechanically and without an adequate recognition 
of the assumptions or hypotheses on which the procedures are based. Itis the object of this note to 
restate the underlying theory 2a little more simply, and then to show, by an actual example, first the 
value of the supplementary device that Professor Burt has termed the ' factorial analysis of variance ’, 
and secondly the superiority, for this particular problem, of a ‘group factor analysis’ rather than a 
summational or centroid procedure. 

Illustrative Example. For the purposes of the inquiry referred to in my note, eight intelligence 
tests were constructed, and arranged to form two parallel batteries with similar subtests in each. Each 
of the subtests consisted of 20 problems or tasks of increasing difficulty. As they were intended for a 
comparatively illiterate population, all were of a non-verbal or performance type. Test 1 consisted 
of a picture-completion test—four stories each containing five panels in series with an inset to be 
chosen and inserted in each ; Test 2 of geometrical designs to be constructed with coloured blocks, 
similar to Kohs' ; Test 3 of 20 mazes, similar to the Porteus series ; Test 4 of diagrammatic * matrices’ 
similar to Raven's series. А hundred boys were selected as forming a random sample of the popula- 


tion, and were tested with the first battery as nearly as possible on their eleventh birthday and again 
bsent on the second occasion, so 


with the second battery some six months later. Thirteen were abst 1 
that the data analysed here relates to 87 individuals only. In scoring the test, arbitrary marks of 
2 to 5 were awarded for each task correctly performed, the amount for each being determined partly 
by its complexity and partly by the testee’s speed. Taking the unweighted sum of each pupil's marks 
for each subtest as his ° final mark’ for the battery, and accepting the teachers' assessments for in- 
telligence as a provisional criterion, we obtained validity coefficients of 0-56 for the first trial and 0:63 
for the second—an average of 0:59. This is a somewhat disappointing figure. But as we shall see 
later on, better correlations are obtained when the subtests are given different weights ; and, since 
the children were drawn from different schools and the teachers' standards varied appreciably, the 
criterion does not provide very trustworthy estimates of individual differences, and can only be re- 


garded as furnishing a convenient overall check. > кк 
(a) Analysis of Variance. Asa preliminary it seemed advisable to determine (i) whether the sub- 
tests comprising the two batteries showed any appreciable differences either in the general difficulty 
of the tasks or in the general method of marking, and (ii) whether the abilities that we sought to test 
herefore began by making a complete three-way 


showed any appreciable change with time.3 We t 
jance ’, Brit. J. Statist. Psychol., VII (1954), p. 61. 


ae iabili fficient based on Analysis of Variance t. J. Ste „р. 6 
y ТА Кена ebted to Professor Burt for supplying copies of his original жешогадапп on Reliability 
of Tests and Examinations, drawn up for the International Institute Examinations nquiry, and of his der 
Laboratory Notes, and for permitting me to incorporate in my own presentation w hatever extracts I wished. 
His abridged * three-way analysis of variance ' and what he calls ' the factorial analysis of variance ' appear 
to be especially appropriate to problem: like my own; de save for a few unpublished theses and reports, 
di in any large scale research. Tu. Е “К Я 
hi pe Siia Е fe! modifications have been made, chiefly in the interests of simplification : 
his notation, for example, distinguishes between (a) constants (or * parameters °) which characterize the 
population, whether actual or hypothetical, and (6) the corresponding estimates (or * statistics °) which are 
Obtained from the sample. Here, to avoid complicating the discussion too much for the elementary, reader, 
I һауе abandoned this double symbolism. The Statistical reader will easily be able to make such distinctions 
i i i nt. 
for himself where he considers them ішрона ible to distinguish between the two explanations for the 


3 As will be obvious later on, it is scarcely possi К bety 
change in the averages, unless a more complicated experimental design is used. For such a purpose we 


should need at least four groups of children—one pair taking one battery first, the other pair taking the other 
battery first. » 
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analysis of the variance for the raw scores. The working method followed was that recommended 
by Professor Burt in his Notes on The Construction and Standardization of Tests (illustrated in the 
preceding paper). 

In analysing such data there are three main sources of variation to be considered—the persons 
tested (P), the tests employed (T), and the occasions on which the tests were administered (O). 
The crude analysis, based on the raw unstandardized scores, indicated (i) that the differences between 
the means for the several persons were highly significant ; (ii) that the differences between the means 
for the several subtests were non-significant ; (iii) that the differences between the means obtained 
on the two successive occasions were barely significant at the 5 per cent. level (P — 0:047) : there was 
in fact a slight increase in the general average, due, it appeared, not so much to mental development 
or to previous practice as to slight differences in the standardization. 

Accordingly, it seemed reasonable to eliminate these minor differences between tests and occasions 
as irrelevant, and reduce all scores to standard measure. In computing the standard deviations from 
the raw scores, the observed sums of squares were divided by (№ — 1), not by N, The square-sum for 
each subtest will therefore be 86 and the mean zero. А fresh analysis of variance was then carried 
out. The results are shown in Table 1. 

Since we have eliminated all differences between the means for subtests and for trials, and con- 
sequently the interaction. between subtests and trials, we now have to deal with only four 
sources of variation, and the total number of degrees of freedom is reduced to 


(N— 1)тп = (87-1) x 4x2 = 688. 
With the degrees of freedom indicated for the separate sources of variation, all three variance ratios 
appear to be fully significant even at the 0:1 per cent. level. 


TABLE I. ANALYSIS OF VARIANCE 


Source of Variation Sum of Degrees of | Mean Square Variance 
Squares Freedom Ratio 
Persons (P) 480-069 86 5:5822 46:7 
Persons and Subtests (PT) 129-748 258 ۰5029 42 
Persons and Trials (PO) 47:326 86 ۰5503 4:6 
Residual (РТО) 30:857 258 +1196 — 
Total 688-000 688 (1-0000) — 


Note.—The value for each ‘Mean Square’ (in column 3) is obtained by dividing the * Sum of Squares’ by the 
‘Degrees of Freedom’ : hence the figure in the last line is not a * Total’ of those above. 

. Using the formula given by Burt,! we can calculate the correlation between the two batteries 
(i.e., the ordinary reliability coefficient) from the variances in the table, without having to compute 
the product-sums. We obtain 
Vp— Vpo à 
Vp+ Vpo ` : * т : 
5:5822—-5503 
= so کد‎ 82 
5582245503 — 9205. where 

Vp and Vpo denote the mean squares for persons and for the interaction between persons and trials. 

The figure thus obtained is no doubt quite promising for a first attempt with a new set of tests ; 
but (assuming we can accept it as it stands) it is too low for us to consider the battery sufficiently 
reliable for immediate use. Moreover, the analysis of variance reveals a feature which the reliability 
coefficient ignores—namely, the large interaction between persons and tests. This certainly calls 
for further scrutiny. Does it represent one additional and unsuspected factor or several? And does 
its magnitude fairly reflect that of the disturbing influences to which it points? 

Psychologists who have used analysis of variance seem generally to suppose that the variances 
thus assigned to the several ‘sources of variation’ can always be identified, just as they are, with the 
variances of the hypothetical components or ‘ factors’ that we are seeking to determine.2 Аза 

1 Factors of the Mind, p. 275, remembering that К (as there defined, our m) = 2. Cf. Marks of Examiners, 
p.255. [See also article, р. 1097. X 

? Burt's comparative tables for * Analysis of Variance with Standardized Marks’ and * Analysis of 
Variance with Factors obtained by Weighted Summation ’ (this Journal, I, 12 and 25, Tables Vi and XV) 
have perhaps encouraged this view. With the data there tabulated the interaction between persons and tests 
and between persons and occasions could be expressed in terms of a single row vector, analogous to a TOW 
of ‘ factor measurements for a single factor in either case, and the variances were directly compared with 
the variances obtained from a factor analysis of the usual kind. But this was no doubt due to the need for 
a short and simplified exposition. In his fuller analysis of the Examinations Council data he stresses the 
need for what he calls the * further analysis of variance ’ to obtain the purified * factorial variancés ’. 
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quadrant, and the cross-correlations in the N.E. and S.W. quadrants. 


TABLE II. CORRELATIONS BETWEEN TESTS: (POOLING SQUARE) 


| 

Total 
Tests 1 DÀ 3 4 (ist 1 2 3 4 Ba 
Trial) (Trial) 

i , Test 

. Picture [1:000] -897  :625 "713 | 3-235 “881 -768 +604 -637 | 2:890 
2. Block -897 [1000] .541 -584 | 3:002 | -825 837 7539 563 2-764 
3. Maze «625 -541 [1:000] -751 | 2-917 | 445 347 -703 -557 | 2:052 
4. Matrix “713 -584 -751 [1-000] | 3-048 | :559 462 -667 -670 | 2:358 
2414 2:513 2427 | 10-064 


Total (Ist Trial) | 3-235 3-022 2:917 3-048 12-222 | 2-710 
445 559 2-710 |[1-000] ۰74 


۰641 735 | 3:150 


1. Picture .881 825 
2. Block в 837 347 -462 | 2414 | .774 [L000] 572 7704 | 3:050 
3. Maze 404 -539 тоз 667 | 2513 | .641 -572 [1000] 728 | 2941 
4. Matrix 37 563 557 670 | 2427 | 735  -704 728 [L000]| 3167 

3.050 2:941 34167 |12:308 


Total(2nd Trial) | 2:890 2:764 2052 2:358 | 10-064 3-150 
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on between the two entire tests. 
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or, to adopt the notation of the articles just cited, 
AX Rab 
УС Raa 3, Rub) 


Thus, with the present example, the square sums for the two batteries are given by the sums of the 
intercorrelations shown in the N.W. and S.E. quadrants respectively of Table II ; while the product 
sum is given by the sum of the cross correlations in the N.E. (or S.W.) quadrant. 

Hence the correlation between the two batteries will be 


10-064 
"ss J2222x12308) 一 8205. 

It will be noted that this is exactly the same value as was obtained from the analysis of variance by 
using equation (i), An algebraic proof of their equivalence will be given in a moment. Our more 
immediate obiect is to factorize this matrix of correlations in terms of the hypothetical components 
corresponding to those sources of variation which the preceding analysis of variance has shown to be 
significant. А 

Correlation as a Ratio of Variances. Fisher,! when first introducing his method of analysing 
variance, begins by explaining how a coefficient of correlation can be expressed as a ratio of two 
variances, or in other words, as “that fraction of the total variance (A+ B) which is due to the cause 


Tss P " * . (iia) 


(A) which the observations have in соттоп”: i.e., in symbolic form, * p = at^ He then 
А 

points out that, whereas “ the value of B (the * error variance ’ so-called) may be estimated directly 

from the variance within groups ", the value found for А, the variance between groups, includes à 

part contributed by B. The argument that follows may be regarded as a generalization of this analysis 

to the case in which we are concerned with three sources of variation (in addition to ‘error 

variance). 

Hypothetical Model. In carrying out such an analysis, and particularly in applying it to the 
determination of reliability, the most urgent need, in my opinion, is to keep clearly in mind the under- 
lying assumptions on which the procedure is based and to consider how far they hold good for the 
data under examination. Strangely enough, these obvious precautions seem rarely to be observed. 
Let us therefore start by attempting an explicit formulation of the assumptions involved. 

For our present purpose, we may most appropriately begin with the simplest of Burt's * mathe- 
matical models ' (as he terms them). It is based on the provisional hypothesis that any observed 
test-measurement, obtained in the course of an investigation like the present, can be regarded as the 
weighted sum of four kinds 2 of hypothetical component—(i) a factor or factors 3 common to all the 
tests on all the occasions ; (ii) a group factor characteristic of similar specimens of the same test 
(with л kinds of test there may be л such components) ; (iii) a group factor representing those con- 
ditions that may be peculiar to the same occasion : (when the tests are applied on two or more occa- 
sions there may be two or more such components) ; (iv) a specific factor peculiar to each single test 
* error of measurement’: (with z tests and 2 occasions there will be 27 such components). 

., Let us then suppose that we are dealing with a battery of n tests, and that these are applied in 
identical or parallel form on two (or more) different days; and let mj, denote the observed mark ог 
measurement 4 obtained by the ith pupil in the jth test on the kth day @ = 1, 2,..., №; ј = 1, 2,...,п; 
k = 1,2,..., т). Let x denote the hypothetical factor measurements ; and let g, d, t and 'e denote 
the relative weights (or factor saturations) of the four kinds of factor. Let the subscript i denote 
the individual to whom the measurements refer, and the subscript j or j’ the test to which the satura- 


tions refer, the subscript k being added where it is necessary to distinguish days. Then, for any given 
day, we may write d 


mij = gjXgi--ljXi-- djxaj-ejxy — . А E є . (ii) 


1 виа Metodi ar esearch Workers, chap. VII, sect. 40. 

А is is merely a special case of the * four factor theory ' set out in his di i iabilit 
A Mrs of Examiners, ecn gano ORB y out by Burt in his discussion of reliability 
3 [n the algebraic equations to be developed from these assumptions I shall fi i icity E 

h or assum 

that there is only one component common to all the tests, or that, if there are more, hel vais can b 
summed uu as the variance of a single composite component. This simplification is warranted by 
Bumpuand - However, it means that, if there are factors common to two or more tests in the same battery 
but not to all, our present model must treat them as bipolar factors extending over all the tests (as is done 
in an ordinary ced analysis). We could, of course, avoid this by introducing (as Burt does in one of his 
more elaborate models) a fifth kind of factor, namely, * group factors for special abilities ° But I am anxious 

not to obscure the exposition of the main argument by raising numerous minor issues, 

。 lt seems a He inelegant, but not, I think, confusing, to use m (with Subscripts) for ‘marks’ OF 
measurements ad us (without subscripts) for number of occasions or trials. These symbols have been 
adopted in most of the papers, etc., to which I shall here refer. Hence it Seemed best to retain them. 
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Following the usual convention, let us suppose that both the test-measurements and the hypothetical 
actor-measurements have been reduced to unitary standard measure, so that for any given day 


CN 2 
2" = 1, Уха = 1, etc. 


i 
. Assumption 1. In dealing with observable classifications (such as are used in analysis of variance) 
it may often happen that the crude empirical factors (i.e., the principles of classification) are not 
altogether independent of each other. For purposes of analysis, however, we shall assume that 
the hypothetical components are strictly independent, i.e., mutually uncorrelated. Then all such pro- 


Xj У XjiXji etc., will vanish. Hence, for any given day, 


ti 
2 2. Aid Y. 
оў= Хт = git +d; +e; & ë Т š . (iv) 


duct sums as X x,xy, Xx 
s л 


і 


nee the variance of the jth test will be the sum of the variances of its four components. And similarly 
the correlation between two different tests applied on the same day will be 


ry = туту = уай, (v) 
i 
between two similar tests applied on different days will be 
rag Zmymag = gj, Eit fj bjr ^ 5 Р . (vi) 
And between two different tests applied on two different days it will be simply 
(vii) 


by E 
ja Lm ту, —98j jy 


Ua 

The model we have adopted, together with this first assumption, will enable us to express the 
sums derived from the four quadrants of our * pooling square ” (cf. Table II above) in terms of four 
kinds of hypothetical component. Summing the л values for оў апа Tj and substituting the expres- 
sions given in eqns. (iv) to (vii), we obtain the results shown below in Table III (reproduced from Burt). 
It will be observed that, in virtue of the derivation of the factorial expansions for the various coefficients, 
the expressions Эл, ер Улу and 27е, enter only into the diagonals of the N.W. and S.E. quad- 
rants respectively, and the expression JX f, tj’, enters only into the diagonal of the N.E. and S.W. 
quadrants : in other words, these values are regarded as ' specifics’ which merely augment the * self- 
correlations ' in the matrices Raa, Rab, and Rp». To determine the best values for each of the several 
factor saturations, a formal factor analysis by the group procedure will be essential (cf. Table VI 
below, р. 133). Meanwhile, if we introduce two further simplifications, we can reach a plausible 
approximation by means of a direct analysis of variance. 


TABLE Ш. POOLING SQUARE WITH DIFFERENTIAL SATURATIONS 
Sums of Correlations in terms of Hypothetical Saturations 


Summed Correlations 


Factor Saturations 
Eu dua = fy - ey - = - 2 Каа = vem 
Ns ES I E 
Ez da ty = ên (gy HGY +25, Ee, Es, t Zt fh 
SOUS Hd aca dais 2 j ee d - 
УК = 


82 - dà tn - - - en- | ХУ Roa = 
Г mos к= == vem iy Lent Zt te Qs y Q4) Eg ES, 
j j + 


.. ! For a rigorous treatment it 
this refer to the population or to tl 
mentioned above, such a distinction е 


fectly clear whether such conditions as 
he sample. That, in fact, is the procedure followed by Burt. But, as 
ntails a double set of symbols (Greek for the former, Roman for the 
latter) which is likely to bewilder the ordinary reader. It is perhaps sufficient to note that the model itself 
refers primarily to the population. Largely for convenience in subsequent deductions, certain modes of 
factor analysis also assume that the hypothetical factor measurements are uncorrelated in the sample. 

hen the model is applied in any particular research, the investigator should first satisfy himself that the 
Conditions postulated hold good within a reasonable approximation. For purposes of simplified calculation 
the requisite assumptions can then be supposed to be true also of the data contained in the sample. 
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Assumption 2. I shall next assume that corresponding tests in the two batteries may be treated 
as * equivalent’. Burt’s definition of * equivalence ' differs somewhat from that ordinarily put for- 
ward, and is perhaps a little too stringent. For any pair of tests to be regarded as * equivalent 
(he says), (i) their means and (ii) their standard deviations must be the same ; (iii) they must depend 
on the same common factors (whether general and bipolar or basic and group) ; (iv) they must have 
the same weight for the same common factors and the same weight for the error factors (the error 
factors, of course, unlike the common factors, will not be thesame).! From this it would follow that 
the corresponding self-correlations and intercorrelations in the N.W. and S.E. quadrants respectively 
differ only by such small amounts as may be attributable to errors of sampling. Evidently this is 
a corollary whose validity can be tested for any given set of data. In the present research, it will 
be seen, some of the observed intercorrelations do differ significantly ; and these differences indicate 
obvious points at which the batteries require, if possible, to be improved. However, the totals 
for the two quadrants (12:222 and 12-308) differ hardly at all; and that may perhaps suffice to 
justify a provisional acceptance. Я 

... With this assumption the best estimate for the common standard deviation of the two batteries 
will be obtained by taking the arithmetic mean of the variances of both ; and accordingly, as Burt 
suggests, we may now use the formula for the intra-class correlation (instead of the more familiar 
product-moment formula) together with the scheme for analysing variance associated with this 
coefficient.? In place of eqn. (ii a), therefore, we shall for the future substitute 


У Rab 
~ ACE Raat J Rib) 


With the present data, the two sums, 37 Raa and Y Rob, are so nearly equal that the substitution of the 
arithmetic mean for the geometric leaves the value for rss’ (:8205 as given above) virtually unchanged. 

Assumption 3. The summations carried out in an analysis of variance give equal weights to 
each component of the same type: in effect, the method assumes that gi; = g2 = ... = g (say), 
and dıı = dai =... = d (say), and so on, where g, d, etc., denote the average saturations. It is, 
of course, a familiar principle that, in summing a number of test-measurements, unless the differences 
between the weights are large, the substitution of equal weights has little effect on the relative values 
of the sums obtained (see this Journal, III, p. 111). Moreover, in constructing two parallel batteries of 
tests designed to measure the same ability, the investigator will, as a rule, discard any subtests which 
have a decidedly low weight for the general factor, or are likely to be more influenced than the rest 
by the conditions peculiar to a given trial. We may, therefore, now introduce this third assumption, 
which will still further simplify the resulting formulae and bring our factorization still more into 
line with the analysis of variance, namely, that, though the weights (or saturations) for different 
factors will usually be different, the weights for the same factor are the same for all the tests. 

In addition then to what may be called the simple * four factor hypothesis ’, our model will for 
the present assume (i) that the factors postulated are orthogonal, (ii) that as regards their factorial 
composition the test-batteries are equivalent, and (iii) that the saturations may differ from one factor 
to another but are the same for the same factor. In our present data there is nothing that flagrantly 
conflicts with these assumptions : but for a strict test of their accuracy a full factor analysis would be 
essential. This we shall attempt at a later stage. 

We can now reconstruct the correlations to be expected on the basis of these last two assumptions. 
As will be seen from Table IV, the hypothesis on which our model is based treats the N.W. and S.E. 


rss (viii) 


TABLE IV. POOLING SQUARE WITH EQUALIZED SATURATIONS 
Sums of Correlations in Terms of Hypothetical Saturations 


Factor Saturations Summed Correlations 


€ d - tes | 
Th d де >> | S Raa | > Rab 
d eem] = (п8)24- (па)2-- ni24- ne | = egy? 
g-dt---sg-— 
»-4-i---3] 2 Rta | & Rob 
em = (ng? nt | = (ng)2+ (nd)? 4-nt24- ne? 


1 Once again I have ventured to simplify. Burt distingui: * equi llel 
tests’. But this is unnecessary for our ULM istinguishes between * equivalent tests ’ and * рага! 


2 See Fisher, loc. cit. sup., pp. 199 f., and Tables 38 and 39. 
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M п, These can readily be computed by the device which Burt has called the * factorial analysis 
variance '—a further analysis in which the crude variances, computed for an ordinary analysis of 
the purified variances for the several 


varia, 2 н ^ 
nce, are themselves re-analysed to obtain, as it were, 


y i 1 
Pothetical factors that express the classification employed. . А 
а crude va, P рне formulae are given in the earlier memorandum already cited. Using V to denote 
the vane ance derived by an ordinary analysis of variance (as in Table 1 above) and c? to denote 
Values fj nces of the hypothetical factors, we have the following equations, and then, inserting the 
or V given in Table 1, we obtain the figures shown on the right : 


o? = (Ио Vp Vot Voto) - mn -5811 

Op = (Ин Vpio) = +1916 
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s for each subte 


must 
col be 86x8 = 688. But this was à 


Obtain 


moment. : m 

(b) 7 already summarized in Table IV we can deduce 

Ше rel aig ational Factors. From. the forma further analysis of variance and those Ба 

У the < etween the factors obtains rm] correlations? The method of deduction will be 
izing 


Sum method ”of factorizini 
memoranda referred to above. 


GS 275, and the I.I.E.C. 
2 c C. Burt, Fact the Mind, 1940, р. 27>» 
Cf. this Journal, ors ao). 62 (appendix). 
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simplified if, as Table IV suggests, we first average the intercorrelations for the several subte sts 
We can thus re-write the values in the pooling square as follows : 


Total Total 
l RE. m 1--(n—1y, а Ta зз Fea (1— Dra 
i d 1+(n—1)r, fa ka ж Feat Q1— Dia 
Total | ‘n1 +(n— 17] пг QI DE 
nr ба ws Feat (1— Vra ME NS 1+(1— Drs 
D mn . Feat (n— Dra ж d sa 1-0-0 


Or, substituting the numerical averages from Table i" 


| Total Total 

1:0000 -6888 aes 3-0664 "7727 -5811 s 2:5160 
-6888 1:0000 js 3-0664 “5811 "7727 кз 2-5160 
— — — 12-2656 — — — 10-0640 


If (as here) the tests have been standardised so that Y o? = 1-000, then the values for the factor 
variances will be given by the following equations : 


g- o2 =н = +5811, 
P = 02, Gu = 7727—5811 = 1916, 
P= 02, Rm = :6888—:5811 = 1077, 


22 = opo = 1—Feg— Pot = 1—-7727—-6888---5811 = +1196, 


It will be seen that these equations yield exactly the same values as before. They are in fact merely 
a simplification of the ordinary method of group factor analysis, adapted to fit the hypothetical 
scheme of basic, group, and specific factors that we postulated at the outset. 

Various Forms of the Reliability Coefficient. Now, as has frequently been pointed out, there are 
in practice several alternative ways of estimating the socalled reliability of a test ; and these different 
procedures are by no means equivalent. Indeed, they imply quite different definitions for what is 
meant by reliability; and much confusion would be avoided if they were given different names. 
Three main forms are commonly distinguished ; and, as Burt has Observed, they may most con- 
veniently be expressed in terms of factors such as we have just described. Slightly modifying his 


definitions to bring them into line with those of later writers, we may formulate them as follows.2 
1 E.g., J. P. Guilford, Psy 


chometric Methods (1936), .411; H. i 15 
(1950) ee ТЫК п ric Methods (1936), p. Н. Gulliksen, The Theory of Mental Tesi 


2 One of the earliest publications which uses factor-theory in a discussion of reliability j article 
by Miss B. M. D. Cast on marking English composition (Brit. J. Educ. Psychol., 1X, 1939, a ЭА: 1940, 


1 iance to Problems of Correlation ’, this J 1, V, 1952). 
lexander, like Burt, makes full use of the intra-class correlation (C "The Estimation of 


f 5 tive ‹ А ѕ raised, which is also expressed іп te 
(viz., Holzinger's bifactor theory) will be found іп L. J. Cronbach, ‘ Test Reliabil;, i 

~ da , iability : Meaning ап! 
Determination ’ (Psychometrika, XII, 1947, pp. 1-16). The algebraic expressions there ee id no! 
or the analysis of variance : and the iactors refer 
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, Definition 1. What Burt calls the * Coefficient of Precision ' is that form of reliability coefficient 
Which measures the degree to which the measurements obtained from a test or battery on one occasion 
will agree with the measurements obtained from the same test or battery on a second occasion. This 
was the original meaning intended by those early psychologists who introduced the term. Such a 
conception tacitly assumes (a) that the function tested itself remains unaltered during the interval 
of time that separates the two testings ; (5) that on the second occasion the measurements or the 
functions are themselves unaffected by the fact that the function has already been tested on a previous ` 
Occasion ; and (c) that each subtest measures the same function as the battery taken as a whole, 
i.e., that * unique ’ or * specific factors ' (other than those that express accidental influences) have been 
eliminated either experimentally or statistically. These assumptions imply that both 72 and @ will 

€ zero, or at any rate negligible, or, what amounts to the same thing, that both the interaction 
between persons and tests and the interaction between persons and occasions are non-significant. 


In that case the expression for r,,, would reduce to 


.* 的 


Tp (say) = 


—2 214,2 
ng" nop Opto 


This simple * two factor ° interpretation was implicitly accepted by Spearman when introducing his 
of the two sets of assessments. 


formula for * correcting’ validity-coefficients for the unreliability of 2 
he implied analysis is clearly brought out in the proof of Spearman's formula given by Yule, who 
at the same time indicated methods for testing the legitimacy of the main assumptions.1 , Much the 
Same assumptions are regularly made by the physicist when he assesses the precision of his measure- 
ments by repeating his observations ; but with psychological assessments the three assumptions of 
stability, of independence, and of non-specificity may each or all be gravely violated by the data 
actually obtained. This demands obvious modifications in the psychologists proceedure. 

(i) To begin with, since the two batteries are applied on two distinct occasions, it is always 
Possible that the ability may have changed, or that its manifestations on the second occasion may 
differ from its manifestations on the first, either because the conditions on each occasion are different, 
or because the previous testing may have produced practice-effects, or else because, to avoid memory 
and practice, different versions of the tests have been applied. Whenever there are any prior Ae 
for anticipating some such influence, we must re-introduce the component represented by d^. 


We then obtain 
ES 
ng : 09) 
Тр 525 . á У 
n+ nd e 
у ye б 
The effect of the additional component will obviously be to reduce the apparent reliability ; Spearman’s 
Correction formula would therefore exaggerate the ‘ true’ value of the validity coe! ER 5 
(ii) When the subtests used on the second occasion are the same as, or at least similar E lose 
used on the first, it will be natural to expect an increase in the correlations between similar subtests, 
either as a result of specific factors or possibly of practice. This will require the re-introduction of the 
component whose variance is represented by 12, a component common to both occasions. If at the 
same time we assume (as Spearman does) that the conditions on the two occasions are identical, 
the formula will be : 
s (xii) 


This time the effect of the additional component will be to increase the apparent reliability. . 
i ivi i ter- 
i deriving r,’ or rp” from the pooling square. When the in 
ие esae, dien ae " icant, tentative estimates could be deduced 


i i -signifi РЕТ 5 
fae Le ааа re thoes apie. versions of the reliability coefficient 


from a ‘ factorial analysis of variance’. 

will seldom be wanted in a systematic research where adequate data have a E 
they call for no special name or formal definition. In practice, it will nearly 3 Ai р S AS 
that borh these irrelevant disturbances may be operative, even if their effects do not seem үп [у Signi 
ficant. Consequently, the expression for the total variance, which forms the denominator o: š 


isi tly the treatment is some- 
t to subtests comprising à battery. Consequent 
what more complicated, e.g tiis basic equation contains seven terms, five of d are liem ees ums. 
On the other hand, Cronbach's equations do not explicitly contain factors for m a XE Occasit ous Ord qe 
огеоуег, whereas Burt distinguishes types of reliability coefficient in to 0 е pute SA UE 
(or more) applications, Cronbach distinguishes his types by specifying the factors regar 
“е, iati: Two Things’, Amer. J. Psychol. 
1C. E Measurement of Association between Two gs’, Я 
ху, 1904 2 3 G. D So in жк to the Theory of Statistics (1912), pp. 213-14, BD 7. 
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must allow for the presence of both additional sources of variation. But are we also to include them 
in the numerator of the ratio, or should we endeavour to eliminate them? The answer depends 
on what precisely we desire to measure. At least three different replies are possible. Hence we 
have to consider three further types of reliability coefficient and three further formulae. 

Definition 2. The * Coefficient of External Consistency ' or * Equivalence ’ measures the degree 
to which а composite test, applied on a second occasion, either in the same or in a closely parallel 
form, yields results agreeing with those of the first application, even though the conditions of testing 
may differ. If the tests are identical, we may speak of * consistency’; if not, of * equivalence E 


- - —2. 
For this purpose we evidently require to introduce / into the numerator and both 7 and (па)? into the 
denominator. The formula will therefore be that already reached (eqn. ix) when calculating а 
test-retest correlation from the pooling square, namely: 


324.1732 2 2 
ng +t поро, 


(xiii) 


2 


221231 22:2 2 2 2" 
ng^4- nd^4-t^4-€ notn Ozo ор Fe 


This is the form to which eqn. ix reduces when л is cancelled from both numerator and denominator. 
We may therefore regard it as the variance equivalent of the familiar * test-retest correlation ”with 
parallel forms. 


Substituting the values already obtained, we have 

4x'5811-4-:1916 2:5160 
4x:58112-4x-10774-:19164--1196 3:0664 
Or, using the average correlations 2 as they stand, 


le 


:8205. 


Fid 十 (2 一 DF  2:5160 
Te = SAT d 0.8205, 
1--(n—l)r,  3:0664 
Nevertheless, instructive as this coefficient may be for preliminary inquiries, it does not appear 
to furnish precisely the information we require. Our tests have been designed to measure the common 


ability, g (whose variance is designated by од) : the more specific abilities peculiar to the several 


subtests (whose variance is designated by 92) do not, as a rule, form part of the particular function 


we set out to test. We must therefore seek some way of eliminating these disturbing factors from the 
numerator. If we keep to a correlational technique, the appropriate device seems plain. Much 
the same difficulty confronted the earlier factorists ; and to eliminate the irrelevant specifics from their 
correlation matrices, they proceeded to substitute ‘reduced self-correlations’ or reduced self- 
covariances ’ for the self-correlations or observed test-variances entered іп the leading diagonal. Here 
же сап apply the same procedure either (i) to the submatrix of cross-correlations (Rab), thus eliminating 
7, or (ii) to the submatrix of intercorrelations (Raa), thus eliminating (/7--22). In ordinary factor 
analysis the values to be inserted would be reached by successive approximation ; but if (as we are 
here assuming) the saturations for the same factor do not differ greatly in magnitude, we may plausibly 
treat the average value of the self-correlations as approximately equal to the average of the inter- 
correlations. To indicate that the submatrices now have reduced values in their diagonals, I shall 
follow the usual convention, and affix an asterisk. 

By referring to Tables I and IV, the reader can readily verify for himself the fact that the cor- 


relational procedure, with these reductions, yields the same values as the variance procedure, with 
the proposed omissions. We have in fact 


Z Ra, = n(162--02, 4-02, 4-02) = 4(2.3244 十 .4308 十 .1916 十 .1196) = 12-2656; 


J Rag = п(ло5-Е по?) = 4(2.3244 十 .4308) = 11.0208 ; 
Z Ray = п(по2-+02, = 4(2.3244 十 .1916) — 10-064 ; 
2 К = (no?) = 40-3244) = 92976. 


1 The term * coefficient of consistency * w: і i 
оша асо топо cansa зс ав proposed by P. Hartog (Marks of Examiners (1936), р. xiv， 


/ г al consistency * was suggested to denot ifically 
the coefficient derived from data obtained at a single ini: 25 SUBE note more speci 
subtests, and the phrase ' coefficient of ex gle administration, i.e., from the intercorrelations of the 


1 ternal consistency " to denote the coefficie i ^ 
est nt om two 
administrations. — А ‚ ? Cf. Burt, Marks of Examiners, p. 303, eqn. xl ORE 

3 In some inquiries, particularly in the field of educational c i ction 


А 1 A ч and vocational selecti iner may 

Sol: Ss e MAE capacity and special aptitudes or knowledge. Thus, in the 98 Sunior county 

intention er TES hat AM е e spelling: composition, and arithmetic were set with the 

ar T 5 general educati: ili á P H 

abilities and attainments (г). In such cases eqn. xiii remains ше С ond NE тыран 
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A Р T 
pad ша жаншап e obtain three further coefficients. 
ote ee . T ent of Internal Consistency " measures the » whi 
SES secon] алда кзы UNS with each other, and therefore the Vu Maki qoa pese 
Se vith a battery assumed to be i i i i 
Pond غ‎ сүнүн internally identical with the first and undi 
\ e exactly the same, would i fi ped 
ee ) bi actly , would agree with the results of the first. i 
ymbolic correlation matrix (Table IV) it is evident that the theoretical xe Nes 


the above assumptions will be 
п @ +4? XR. _ 110208 
dre ^ XR 12-2656 


т Р ОИ " " 
he coefficient refers primarily to the internal consistency of the test as assessed by the results 


of a si i i 
ingle trial.! But naturally, the figure will vary somewhat with the results of different trials. 
If there is any likelihood that 


= тае effect averaged the consistency coefficient for two. 

Onthe sr E шу rey alter the internal consistency, then we must base our coefficient solely 

The ae ort B the same procedure here yields rc — -8970. 

hene sihe on ican t ue De employed when the tests have only been administered once; this 

odistinguish m ion ofa |l such formulae. However, with a single trial it is obviously impossible 

common eal Ven £^ and d? or between t^ and €” : the two former are merged into a single factor 

obtained fro е four tests and the two latter represent the specific factors peculiar to the results 
m a single test on a single occasion. Consequently the pooled variances may be re- 


written 


"8985. . ғ . (xiv) 


c 

"с 

"oc & 5 » * A . (xv) 
осор 


and thi i 
Agi i th E Burt has shown, is equivalent to the * split half ° reliability coefficient obtained 
Definition 4 ке possible splits are taken and averaged (this Journal, VI, 61). 
of an ability asses d e * Coefficient of Test Stability " indicates the degree to which the measurements 
by a second set sed by one set of tests remain the same (or would remain the same) when measured 
will be , even though the methods or conditions of testing may differ. The theoretical formula 
2 XR 2 

— XR _ 92976 у . 0. 0. i) 
m n(g-+d?)-+4 prs 12- 
represents the ihat g (the general factor as assessed by the unweighted means for the several persons) 
nearest to ex characteristic that the battery of tests was designed to measure, this equation comes 
that the ETE what most psychologists appear to have meant by reliability. It will be seen 
should b ща сап only be computed if data from at least two trials are available. The tests 
E usata so сыре, and the administration so carried out, that both d? and г“ are reduced to a 
the coefficie an ушп that has been done, the coefficient so calculated will probably approximate to 
the functi nt of internal consistency. But, unless the structure of the test and the peculiarities of 
wheth ion tested (e.g. its liability to change) are already well known, it would be rash to state 

ег that coeflicient indicates a lower or an upper bound. А | 

аб, efinition 5. The * Coefficient of Person Stability" measures the extent to which the relative 
pis "rd a given set of persons, assessed on two or ps separate daya: have iy m ga e Шш 
0 я ications or (particularly 1 e intervi 
} i M interval between the m application this the most obvious formula will be 


Variations in the conditions that obtained, For 
2 YR, 96 qnm o. 5 co! 
= 11-0208 


y’ by certain writer 
A given function о! 
portional rate: 


° is fully discussed by Bur 


sit 


£ (xvii) 
m uli Tu 
sp = Ayd X Каа 
* function stabilit 


Thi 
; 8 с 
tion Oncept is P A 
s not unlike that designated $- 
t Bene С $ 
0 ch rally offered overlook an important qualifica Rite or prol 


an 
Be in all persons at approximately the same 8 
-application Š pp. 80-92, and the use 0 


Reli ‘The d i 1 
abijj Pe determinati iability from a single 105 
ance ОГ раина а prir. J. Educ. Psychol., XV, 1 
Ra strated in detail. (Trait Variability” Psychol. Bull, XXVI | 931, pr. 218 f; en] 
И - G. B. Paulsen, * A Coefficient о rai uation’. J rit. J. Psychol, XX th se suggested. by 
Fluc virtually identical УЙШ ат of Examiners 


s2 But the explana- 
r ability may ten 
if, for example, 


tin ' The 
f analysis 


sho; Th ٤ i 
mould Ades. jee Unreliability and Func even above are Bat 
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1936 memor. natio t 
). andum for the poy are here given 


129 


Test Reliability in Terms of Factor Theory 


the I.Q. were absolutely constant for every child, the mental ages of the children tested would alter, 
and alter by different annual increments, but the spaced order of merit would remain the same from * 
year to year; the same would happen if all pupils increased in the knowledge of a given school 
subject in accordance with some constant educational quotient. This kind of change is not included 
in the foregoing concept. In actual fact, however, as numerous investigations have shown, the cor- 
relations between the I.Q.’s of the same group of pupils tend to diminish with increase in time ; 
and this implies that the changes in mental age take place more or less erratically, so that the order 
of ability does not remain the same. Hence what we require to measure is the relative stability of the 
persons rather than the stability of the function or capacity in and for itself. б 
The Coefficient of Trait Variability. By subtracting апу of the foregoing * coefficients of reli- 
ability ' from unity, we could if we wished obtain a * coefficient of unreliability ’ (or* error’). There 
is, however, one additional coefficient of unreliability that deserves special mention, namely, the * Co- 
efficient of Trait Variability’. This measures the degree to which the trait or ability fluctuates from 
one occasion to another. The amount of variation is indicated by subtracting the * Coefficient of 
Test Stability ’ (i.e.. stability of the test measurements from one day to another), not from unity, but 


from the * Coefficient of Internal Consistency ' (i.e., consistency of tests applied on the same day). 
This yields 


2 
n m 
Оро = ۰8985-۰7580 = 1405. .  .  . (хүй) 


TED aa 
n (cz 二 9) +990 


Қу) = 


It is instructive to compare this with Thouless's formula for ‘ function fluctuation ， (loc. cit., p. 332), · 


а-а) b2) 
р) = cei ba) 
en * E (ль) 


where a and b denote the two tests, and 1 and 2 the occasions on which they are applied. Itis not 
easy to express this in terms of the constants we have so far used, because with only two tests and two 
occasions the variances of the tests and still more of their differences are liable to be highly erratic. 
However, making the same assumptions as before,! it becomes very appr xximately equivalent to 


2 
g2 —— . . . 5 . . 
(о2,+-92,,)(02--02, 


© а € à وھ‎ «a ow dA 


(хх) 


(o — Voto) ( VotVort Voot мо) 


" 5 à - á . (xa) 
Vpot Voi) (V5 Vpt + Vpo— Vpro) 


which reduces to equation xx when (as here) the measurements are so standardized that 


2 x2 2 
9-02, +02, +0, = 1:00. 
In his worked example he uses the constants obtained in his article on ‘ Fact 
Variance’ (this Journal, I, p. 12) and obtains (a) with Thouless's formula "031/۰608 = -051 and (b) with 
eqn. xx a ‘038/۰617 = -062. It will be observed that eqn. xx, or its equivalent eqn. xx a, yields a value О 


zero when во is zero and a value of unity when Vp and Ир are both zero. This version of the ratio would 


therefore seem better than the correlational form, but even soi i i s 

f i form, is not quite consistent. However, * for any 
genuine assessment of the amount of instability or trend in the ability or ** function” tested, more than 
two trials are сасна ; and the foregoing formulae are accordingly generalized to meet cases that involve 
more than two trials and more than two subtests in each trial. Where the trends are linear (whether pro- 


duced by positive or negative practice effects due to the t er] 
the ordinary methods of analysis of variance can be SIDON i or by mental growth or training), 


if they are curvilinear or i lar, the line 

E eha Oed by Philpott and Burt in their investigations of Work-curves.‏ ا و 
of the article, seem indispensable, question, factorial methods, such as are used in the later part‏ 
While the present paper is passing through the press, an article by C. C. Andi ‘si ods‏ 

of Testing for Function Fluctuation’ has appeared in Brit. J. Psychol, XLVI, 1955, кр 1:198. orel 
is similar to Burt's, and from it he derives a formula for ‘ Thouless’s criterion’ in te; of * mean squares "+ 


tor Analysis and Analysis of 
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With the foregoing figures eqn. xx yields a value of 0-68 ; i imilar subte 
вові) yelas one or е ы Sues seem far too high. ic MEE cis S 

omparative Values of the Various Coefficients. It is often supposed that the diff 
or assessing reliability may be arranged in order according to the E of the tho difera a 
: instance, several writers have stated that estimates based on single trials—estimates of * internal 
consistency 一 are higher than those based on two trials—the ‘ test-retest? procedure: the former 
it is said, avoids the attenuation due to different days and different conditions, but may be unduly 
A when irrelevant influences operative on the same day affect all the subtests ; the latter is 
iminished by the varying changes the examinees may undergo in the interval between the trials ; 
and, if * equivalent forms ’ have been employed, will be still further reduced by the lack of complete 
ا‎ Other writers have drawn the opposite conclusion. Jackson, for example, compiled 
ا‎ tests of intelligence, and found in every case that estimates based on test and retest were higher 
gu those based on internal consistency (Psychometrika, УП, 159) : such a result is to be expected 
Ў En the compiler selects a heterogeneous set of problems in order to assess intelligence from different 

ng P and perhaps deliberately discards those that are likely to duplicate each other. 

inei comparison of the foregoing formulae will show that inconsistencies like these are almost 
evitable. The differences between one coefficient and another do not result from just adding yet 


another element in turn to the ‘ true variance ': sometimes one element is substituted for another in the 


Fic. 1.—Contributions of Component Variances to Total Variance. 


по? п c, n с поо 
ag па? € e 
Р | PO |ртрто 
o + : 


КУЧЕ) and others are omitted or inserted in the denominator.. In Figure 1 the magnitude of the 
ur component variances as obtained in the present experiment is illustrated by a * bar diagram '. 


the formula. Instead of what is 


Unfortunately he gives no proof, and there are at least two misprints in 
But since in all his experi- 


there printed, we should probably read (in Anderson’s notation) BFO- 7С 


ments р = 2, this simplifies to BIG Anderson's formula is thus equivalent to omitting the division by 


1617 in eqn. (Б) above, and seems to assume that the denominator in Thouless's ratio may be taken as 
approaching unity when the factors are genuinely independent, апа as counteracting the effect of their 
correlations when they are not. 

An alternative formula suggested by Burt is 


2 2 
y S Raa J Rab n 055 t pto 
n - 
9 n+ 3 Rab пе o2, n Dep, 


If the variance due to occasions is zero, this reduces to 7». „тоў „+(п—1)ву which diminishes indefinitely 
y zero if the error variance is also zero. If the 


+o fro, + "pro and becomes exactly unity 
f this * index’ would be *3710—a far 


as n increases indefinitely, and in any case becomes exactl 
Specific variance due to the tests is zero, it reduces to (pg 
if the error variance is also zero. With the present data the value о! 


more plausible figure. S. E 1 
The chief differences between r(, and Ју) arise from the addition of the error variance in the numerator 
and the disappearance of the general factor variance from the denominator. This seems plausible, since the 
АО size of the indi factor шше is quite irrelevant to the question raised which the * index 
Seeks to answer. 2 
. 1 The diagram is inte: epresent a simple and instructive piece of apparatus used for demonstra- 
tions by Burt. А long ended QW AY (similar in form to the diagram) holds a sequence of four oblong 
panels, painted white on the front and black on the back. When um сопзі епа. ше шше їп ШЕ 
numera! ы 4 » shite side is left uppermost; W' en they are included on in the 
tor (the "true variance ), the MEE art ofthe" error variance ’, they are turned over and 


d i ‘ 1 esent 
enominator (the total variance ) and Н from both, they are removed altogether. Various 


the black side js 1 ost; when they à 1 both, they are removed alto 
01 i itr the results obtained from different typ : 
Fout i panels opem P 
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From this it will at once be seen that with the present data the substitution of o, for по?, will reduce 
the value; with other data it may have the opposite effect ; and similarly with the changes in the 
denominator. In short, the equations we have reached make it obvious that no fixed order of mag- 
nitude, valid for all kinds of data, can possibly be laid down. 

Factor Analysis. The formulae which I have so far described may suffice the purposes of a 
preliminary inquiry, It must, however, be remembered that they were based on certain simplifying 
assumptions, introduced partly to permit the adoption of well recognized routine techniques, such 
as an ordinary analysis of variance, and partly to keep the working methods as short and easy as 
possible. Let us now examine the effect of removing some of the most questionable assumptions, 
and inquire how far the more elaborate modes of computation will alter the main results. А 

Throughout the foregoing calculations we have assumed that the weights (i.e., the * loadings " 
or ' saturations ?) for any one factor may be treated as equal for all the tests. This assumption, as we 
have seen, is implicit in the formulae for the analysis of variance. Yet іп practice it is seldom likely 
to be strictly true ; and it is certainly conceivable that the appropriate weights might differ so widely 
as to render the inferences hitherto drawn decidedly precarious. The matrix representing the inter- 
action between tests and persons itself, as was noted at the time, also deserves a more detailed treat- 
ment. Our simplified procedure assumed that the factors producing this interaction were merely 
isolated components specific to each test—in other words, that, apart from the figures in the diagonal, 
the correlational submatrices have unit rank.! But once again such an assumption may be quite 
unwarranted. Ifso, it becomes conceivable that our previous estimate for the reliability of the battery 
asa test of intelligence will turn out to have been unduly magnified by the inclusion of more specialized 
abilities which we had no intention of. measuring. The obvious way to meet both these difficulties 
is to subject the data to a formal factor analysis. 

In our own work we have followed the procedures recommended and used by Dr. Barakat and 
Dr. Moursy in their recent papers.2 

(a) With Bipolar Factors. The matrix of observed correlations, given above in Table II, was 
first factorized by * simple summation '. Three successive approximations (with incidental adjust- 
ments) sufficed to secure appropriate values for the reduced self-correlations. With an 8x 8 matrix, 
the observed coefficients will provide 28 degrees of freedom ; four summational factors would account 
for all but two. Four were accordingly extracted ; but the last was too small to be statistically 
significant, and seemed devoid of any assignable meaning. The saturations for the three Jargest 
are shown in Table V. 

(1) The first is a general factor, contributing over 60 per cent. to the total variance. It would 
presumably be identified, by those who prefer a summational procedure, with the general ability 
Which the tests were designed to measure. The correlation between the factor-measurements for the 
individual testees and the independent assessments for intelligence provided by the teachers amounted 


(II) The second factor contributes more than 10 per cent. to the total variance ; and was quite 
unexpected in its nature. It classifies each battery into two distinct parts. We at first supposed 
this might result from differences in the material used for the four subtests. For the first two it 
consisted of coloured blocks or insets to be manipulated by the child ; for the second two, of black- 
and-white patterns on paper. But, after securing introspections from some of the children and trying 
similar tests with students, we found that the material in which the problems were embodied had far 
less influence than the type of activity they involved : in the first two subtests the dominant process 
was apparently perceptual combination or construction ; in the other two it was perceptual dis- 
ornon : in short, the first two seemed to depend largely on mental synthesis, the latter on mental 
analysis. 

(III) The third factor reveals an obvious contrast between the results 
trials respectively. With this method of analysis it contributes rather less Ades br 
might have been anticipated, namely, 5-5 per cent. каа 


In the light of Burt's comparison between the results of an ordinary analysi i 
p sis of v: se 
ofa summational analysis (this Journal, I, 12 and 25, Tables VI and XV) We were Beer wi 
to identify the three foregoing factors with the three components which nearly always underlie the 
analysis of variance in cases such as ours, namely, those representing (1) differences between ‘ Persons’; 


1 A somewhat similar assumption is adopted by Ki i < Я 
tion of Test EU A гө оше UR, П, 1937, Bb. Beane Richardson (* The Theory of the Estima- 
2 M. K. Barakat, ' Factorial Study of Mathematical АБДУ? th; " 7 
dete тегесе отл Ra tiete computational procedures), dnd E M. Mous PR 44 perds 

e rela superiority of the student or pupil in оп Sof a es DP: 5 
correspond with his intellectual type in other Kinlis of Work. ET «резе Гогтз of lest often seemed 19 
thetic types is not entirely new. We subsequently learnt that, when using Various fori ала аур. 


test (picture-completion, analogies, matrices, etc.), Burt MAS 
contrast : (cf»* Experimental Tests of Higher Menta хла. Мооге Пад also noted a somewhat similar 


Psychol, XIX, p. 192). 1 Processes ’, J. Exp. Ped., V, p. 125, and Brit. J. Educ. 
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2)* i Я 

m. digsotibus berven Persons and Tests’, and (3) ‘Interaction between Persons and Trials’ 
described bove: аз А шүре їозау: cf. TableI). Butthe results ofthe further analysis of variance 
EERE that БЕЗ nd still more of the group factor analysis which we subsequently carried out, made 
а at this identification was only partly correct and might easily prove misleading.! z 


TABLE V. ANALYSIS BY SIMPLE SUMMATION : 
GENERAL AND BIPOLAR FACTORS i 


Tests I п ш 
First Trial 
I. Picture :924 251 +223 
2. Block -860 “371 142 
3. Maze ‘721 —:430 317 
4. Matrix “781 —:309 :213 
Second Trial 
1. Picture :877 :291 —:154 
2. Block -804 -345 —:211 
3. Maze -803 —:340 —-185 
4. Matrix :226 一 :178 一 '344 
Factor Variance 5-015 -831 437 
Contribution to Total 
Variance (per cent) | 62:7 10-4 5-5 


TABLE Vi. GROUP FACTOR ANALYSIS : BASIC, GROUP, 


AND SPECIFIC FACTORS 
Tests Basic Anal. Synth. Ist Day 2ndDay Specifics 
First Trial 
1. Picture .791 452. 008 .341 一 -045 153 
q Block -689 523 — 058 — -337 108 .312 
Maze -597  — 076 -501 A —064 qu 
4. Matrix |68 — -101 464 376 — -013 255 
pi: m 27 40) —03) 02 200 — 153 
icture | 27 392 —0) 02 ° | 
了 plock 453 431 — 008 一 :106 26 312 
4: Maze mp -2 42 10 M as 
— 24 006 387 -06 — 409 EE 2 
Е i er HG COH. . 130 
© 10г Variance | 4-094 857 808 578 493 07 494. — :070 
ONtrib ; ВЕЕ = س‎ 
Tota yton to | ag 
Varia Б " 3 6-1 0.6 24 0-9 
(percent) |512 107 101 7 
t rotations both with 
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with a blind graphical 
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evince Measure, ера 4 it), and these ' fa К alysis. 
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Without changing to * oblique’ or correlated factors, it seemed impossible to reach anything like a 
perfect ‘simple structure’: there were, however, several alternative structures which would reduce 
the variance of the general factor to about 20 or 25 per cent. of the total variance. With the ‘ quadri- 
max method ' it was reduced to 27 per cent. The reliability coefficients reached by such methods 
differed widely, and there seemed no definite reason for preferring one estimate to the other. At the 
same time there appeared to be strong grounds for rejecting all of them. Their multiplicity itself 
seemed to show that they were at once arbitrary and ambiguous. And above all, since the very 
conception of reliability implies the presence of a factor common to the various trials and the various 
tests, it seemed evident that procedures designed to abolish or diminish the general factor were 
quite unsuited to our problem. . 

We then attempted an arithmetical rotation, in accordance with the formulae and working 
methods described in this Journal (III, pp. 53 f.). We started by extracting one basic and four non- 
overlapping group factors. The lines of division between the groups were, as usual, determined by 
the arrangement of positive and negative saturations in the previous bipolarfactors. After extracting 
the five factors corresponding to the significant factors in the preliminary summational analysis, 
we found fairly large residuals still remaining in the principal diagonal of the N.E. quadrant. These 
plainly indicate the presence of what would ordinarily be called * specific factors" peculiar to each 
ofthesubtests. The repetition of the trials converts these specific factors into narrow group-factors 
or ' doublets ’. 

Accordingly, to secure a better approximation, a fresh analysis was carried out after these quasi- 
specifics had been eliminated. For this purpose the self-correlations in the N.E. quadrant were first 
“reduced * by deducting the effects of the specifics, and five group factors extracted as before. To 
obtain saturations for the overlapping group factors a rotation-matrix was then calculated by the 
usual formula (ос. cit., pp. 61 f.), and applied to the summational factor matrix. The results finally 
reached are set out in Table VI. 

As usual, it was found that the ‘ basic factor ’ accounts for rather less of the variance than the 
corresponding * general factor" obtained by the summational procedure. On the other hand, it 
correlates more highly with the teachers’ assessments for general intelligence (0-76 instead of 0-67), 
and manifestly yields a better estimate of the ability our battery was designed to measure. The next 
two factors apparently represent specialized abilities for what we have tentatively called ‘ mental 
analysis ’ and ‘ mental synthesis’ respectively—two irrelevant abilities which our battery was not 
intended to measure. The last pair of group factors represent the special conditions peculiar to one 
or the other of the two occasions on which the children were tested (including no doubt minor 
unintentional differences between the two forms of the battery). 

Comparison of the Analyses. As we have seen, if we may accept the teachers’ assessments of the 
pupils’ general intelligence as a fair overall criterion, the weighted factor measurements for the * general 
factor ’ provided by the summational analysis would possess a higher validity than the unweighted 
means of the test-scores (to which the analysis of variance refers), and those for the * basic factor ’ 
provided by the group factor analysis a validity that is higher still. 

With both methods of factorization there is clear evidence for what we could not have discovered 
from the analysis of variance, namely the presence of specialized abilities for two contrasted types of 
test, ‘ analytic’ and * synthetic’. These specialized abilities and the effects of the two different trials 
seem to be represented much more clearly and more naturally by pairs of group factors than by single 
bipolar factors. The group factor analysis moreover discloses what the summational or bipolar 
analysis could scarcely be expected to reveal, namely, the presence of quasi-specifics each peculiar to 
the two different forms of the same subtest. Now that these various supplementary factors have been 
detected by factor analysis, we could, if we wished, go on to test their significance by a more elaborate 
analysis of variance (here they are fully significant) ; and accordingly it seems plain that in future 
investigations the factor analysis might well be undertaken at the very outset. 

. In the present inquiry, with the * factorial analysis of variance ’ the component for persons con- 
tributes 58 per cent. to the total variance. With the summational analysis the general factor con- 
tributes nearly 63 per cent. With the group factor analysis the basic factor contributes only 51 per 
cent. But, since the basic factor corresponds more closely with the ability we set out to measure, the 
last figure, which is also the lowest, here provides the safest guide.1 Apparently with the analysis of 


1 Of course, had our aim been to test the combined resultant of all these abiliti i i d 
no longer hold good. The analysis thus mak: Gi these abilities, this Caucus 


liability we aim at is the reliability of the test-measurements, not ly of the ti i if 
our aim is to measure ‘ innate general ability ° i OLIM est material. . . . Thus, i 


y. Weighting the tests will introduce further differences- 
ord * test ’ is to be regarded as a shor 
These comments deserve re-quoting, 


5 by several recent critics (e.g. 
reliability’ in The Appraisal of Intelligence (1954), e ais (e.g., by А 
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